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Modtaget 

TITLE: FAMILY GH 61 POLYPEPTIDES 1 2 JULI 2003 

PVS 

FIELD OF THE INVENTION 

The present invention relates to xylanase enzymes belonging to the GH 61 family or 
homology group of polypeptides. The invention further relate to polynucleotides encoding said 
5 xylanases, to nucleic add constructs comprising such polynucleotides and to expression 
vectors and recombinant host cells comprising such constructs. The invention also relate to 
processes for preparing said xylanases, to compositions comprising said xylanases and to the 
use of said xylanases in various applications. Additionally, the invention relate to dough 
compositions comprising GH 61 polypeptides and the use of GH 61 polypeptides for preparing 

10 of dough or dough premix composition used for preparing baked products. The invention also 
relates to polypeptides pariicularly u seful for improving properties of baked products and/or 
dough used for making baked products. The invention further relate to polynucleotides 
encoding said polypeptides, to nucleic acid constructs comprising such polynudeotides and to 
expression vectors and recombinant host cells comprising such constructs. The Invention also 

15 relates to processes for preparing said polypeptides and to compositions comprising said 
polypeptides. 



BACKGROUND OF THE INVENTION 

Cart}ohydrates and glycoconjugates are substrates for glycosyl transferases (GTs) 
20 and g lyooside hydrolases (GHs). The structure of glycoside hydrolases began to be solved 
starting from the 1980s. At the same time, new GH proteins were discovered and their amino 
acid sequence determined. Two main observations emerged from the new data. 1) The classi- 
cal E.C. nomenclature system for naming enzyme families was not predse enough to classify 
the increasing number of enzymes that had different strudure yet perfomned the same enzy- 
25 matic readion. 2) Enzymes related by homology could have different enzymatic activity thus 
also making the E.C. nomendature system confusing for these related enzymes. A new family 
based nomendature system was proposed by Bemard Henrissat in 1991 based on the struc- 
ture of the enzymes (Henrissat B., A classification of glycosyl hydrolases based on amino-add 
sequence similarities. Biochem. J. 280:309-316(1991): Henrissat B., Bairoch A. New families 
30 in the classification of glycosyl hydrolases based on amino- acid sequence similarities. Bio- 
chem. J. 293:781-788(1993); Henrissat B., Bairoch A. Updating the sequence-based classifi- 
cation of glycosyl hydrolases. Biochem. J. 316:695-696(1996) and Davies G.. Henrissat B. 
Strudures and mechanisms of glycosyl hydrolases. Structure 3:853-859(1995).). The classifi- 
cation of glycoside hydrolases in families based on amino add sequence similarities was intro- 
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duced because there is a dire<^ relationship t^etween sequence and folding similarities, and 
such a classification is expected to: 

(i) reflect the structural features of the enzymes, which cannot be reflected by the substrate 
5 specificity alone, 

(11) help to reveal the evolutionary relationships between the enzymes, and 
(jjj) provide a convenient tool to derive mechanistic information. 

Amino add sequences grouped by nature of their similarity to a particular GH family 

10 can give ideas as to the activity of the new hypothetical protein. Some of these amino acid se- 
quences, grouped in a GH family by homology have later been suggested to have certain en- 
zymatic activity. So. in short, grouping a new amino acid sequence in a GH family does not 
specifically indicate the exact enzymatic activity. The enzymatic activity must be demonstrated 
by an activity assay of the cloned or purified protein. If the assay is difficult determination of the 

15 proteins actual function can remain un-revealed for years. 

Publicly available information on the GH-61 family counts presently only 6 nucleotide 
sequences of unknown function. One document discloses^ however, a guess that one of these 
sequences (SwissProt sequence 014405) encode an endoglucanase enzyme (Saloheimo M., 
Nakari-Setaelae T., Tenkanen M., Penttilae M.; (1997) "cDNA cloning of a Trichoderma reesei 

20 ceUulase and demonstration of endoglucanase activity by expression in yeast."; Eur. J. Bio- 
chem. 249:584-591(1997). Wori^ by the same group conftnmed that the enzyme GH61A, when 
purified showed very weak cellulase activity. The group itself admitted that since the activity 
was three orders of magnitude lower than normal cellulases, that perhaps the cellulose was 
not the correct native substrate for the enzyme. The group also made an exhaustive study of 

25 the purified enzyme with all other known carbohydrate assays (mannanase, galactanase etc.) 
and found that the enzyme* had no activity for these substrates. The authors conclude in their 
discussion that: "It is therefore unlikely that the fungus would produce Cel61A for its endoglu- 
canase activity when it is already producing more efficient endoglucanses.... It is possible that 
both TrCel61A and AbCel61A are active against specific parts of more complex natural cellu- 

30 losic substrate. However, further studies are needed to reveal the function of the glycoside hy- 
drolase 61 enzymes." (page 6505). 

Presently, the web-site of CAZY (http://afmb.cnrs-mrs.fr/CAZY/) lists the GH-61 familiy 
as unclassified, meaning that properties like mechanism, catalytic nucleophile/base, catalytic 
proton donors, and 3-D structure are not known for enzymes belonging to this family. Here the 

35 only listed known activity is endoglucanase activity. 

Despite extensive screening of Trichoderma reesei recombinant yeast libraries for oel- 
Mases and recovering many other cellulases from other GH families we have not unambigu- 
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ously identified any endoglucanase belonging to the GH-61 family from Trichoderma reesei 
thus also indicating that GH61. if that family does include celiulases> have only very weak ac- 
tivity and thus cannot be detected by the normally very sensistive recombinant yeast activity 
screening. Hence, the present 6 publicly disclosed nucleotide sequences belonging the GH-61 
5 family are either unknown open reading frames sharing homology to SwissProt sequence 
014405 or sequences cloned based on purification and sequencing of a cellulose induced 
gene (Isolation and characterization of a cellulose-growth-specific gene from Agaricus bis- 
porus;';Gene 119:183-190(1992). and besides the Cel61A protein mentioned above, there Is in 
the art no knowledge of the function and properties of any protein or peptide belonging to the 
10 GH-61 familily nor have any enzyme and/or its function been reliably demonstrated. 

SUMMARY OF THE INVENTION 

Due to our efforts searching for new carbohydrases. we have now for the first time 
discovered xylanase enzymes belonging to the GH-61 family. 

Hence, in a first aspect the Invention provides an isolated GH-61 polypeptide having 
15 xylanase activity. 

In further aspects the invention provides a polynucleotide encoding the xylanase of the 
invention; a nucleic add construct comprising the polynucleotide encoding the xylanase, 
operably linked to one or more control sequences that direct the production of the xylanase a 
suitable host; a recombinant expression vector comprising the nucleic acid construct of the 
20 Invention and to a recombinant host cell comprising the nucleic acid construct of the invention. 

In a still further aspect the invention provides a composition comprising the xylanase 
polypeptide. 

In a still further aspect the invention provides a transgenic plant comprising a 
nucleotide sequence of the invention and capable of expressing the xylanase of the invention. 
25 In still further aspects the invention provides methods for producing a xylanase 

including a method comprising: 

(a) cultivating a strain, which in its wild-type form is capable of producing the xylanase, to 
produce the xylanase; and 

(b) recovering the xylanase 

30 

and a method comprising: 

(a) cultivating a recombinant host cell of the invention under conditions conducive for 
production of the xylanase; and 

(b) recovering the xylanase. 

35 
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in a still further aspect the invention relates to a method for hydrolysing xytan 
connprising contacting a xylan substrate with a xylanase of the invention. 

In a still further aspect the invention relates to use of a xylanase of the invention for 
preparing a baked food product and use of a xylanase of the Invention for improving sensory 
5 softness of a baked food product. 

BRIEF DESCRIPTION OF DRAWINGS 

No drawings 

DETAILED DESCRIPTION OF THE INVENTION 
Definitions 

10 The term "xylanase" as used herein is to be understood as a polypeptide capable of 

hydrolysing xylan substrate. The xylan may be purified from several different sources for which 
the degree of acetylation and characteristics of side chains will vary depending on the source 
material and the purification method. 

The term "xylanase activity" as used herein is to be understood as the ability of a xy- 

15 lanase to convert a xylan substrate into hydrolysis products. The activity is defined as the 
amount of xylan (normally given in pmol) which can be converted into product per time unit 
(normally seconds) per amount of xylanase (normally grams of xylanase) measured at speci- 
fred conditions (nomially optimal pH, temperature and solvent conditions for the xylanase). The 
concept of enzyme activity is known to the skilled person. 

20 The temi "GH" and "GH family or honnK)k)gy groups" as used herein is to be understood 

as a Glycoside Hydrolases, classified In accordance with the well established CAZY classifica- 
tion system. 

The term ''GH-ei" as used herein, is to be understood as a family of enzymes, which 
share common conserved sequence portions and foldings to be classified in family 61 of the 
25 well established CAZY GH classificatk>n system. In a preferred embodiment mature GH-61 
xylanases of the invention share the following conserved portions: 

H at position 1 , 
A or P at position 59, 
30 G at position 60. 
G at position 75, 
P or A at positton 76, 
W or F at position 100, 
F or T at position 101, 
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K or Cat position 102, 
I or V or L at position 103, 
L or I or V or M at position 130, 
P at position 1 31 » 
5 G. Xaa, Y at position 1 37-1 39 
L or V or I or M at position 140 
L or V or I or M at position 141 
R at position 142 
E or Q at positions 143-144, 
10 L or V or I at position 148 
H or Nat position 149 
Cat position 163 and 
P, G and P at position 209-21 1 . 

15 In the present context "Xaa" means any amino add. Also in the present context the 

numbering system for the residues is based on SEQ ID NO:2 wherein the N terminal has been 
removed to the first histidine of the polypeptide (there after denoted Histidlne at position 1). 
This numbering system of residues may be applied to other GH61 proteins through a multiple 
sequence alignment with SEQ ID NO: 2 starting from that first H istidine, disregarding gaps 

20 generated by the multiple sequence alignment. The relative location of Histidine-1 may thus be 
easily detenmlned by multiple sequence alignment with SEQ ID No:2. Alignment may be 
performed by using AlignX in the computer program Vector NTI ver. 7.1 (Informax inc., 7600 
Wisconsin Avenue, Suite #1100. Bethesda, MD 20814, USA). The amino acid alignment is 
created using the Clustal W algorithm (Nucleic Add Research. 22 (22): 4673-4680, 1994) and 

25 the following additional parameters: Gap opening penalty of 10» Gap extension penalty of 0.05, 
Gap separation penalty range of 8. Pairwise alignment parameters were Ktuple = 1, gap 
penalty = 3, gap length opening penalty =10, gap extension penalty = 0.1, window size = 5 
and diagonals - 5. 

The term "core sequence" as used herein is to be understood as the sequence of the 
30 mature xylanase catalytic domain excluding other functional domains such as binding domains 
et. 

The tenm "conserved portion" as used herein is to be understood as an amino add or 
amino acid subsequence contained in all GH-61 xylanases of the invention and thus a com- 
mon feature of all said xylanases. The term conserved portions is also used for nucleotides 
35 and variants thereof which by virtue of the degeneracy of the code encodes conserved amino 
add portions. 
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The term "identity" as used herein, is to be understood as the homology t>etween two 
amino acid sequences or between two nucleotide sequences.. For purposes of the present in- 
vention, the degree of identity between two amino acid sequences is determined by using 
AlignX In the program of Vector NT! ver. 7.1 (Informax inc.. 7600 Wisconsin Avenue Suite 
5 #1 100. Bethesda, MD 20814. USA). Amino acid alignment is created using the Clustal W algo- 
rithm (Nucleic Acid Research, 22 (22): 4673-4680. 1994). The homology score matrix used in 
the first case was blosum62 and in the second case, exact identity of the amino acid se- 
quences in the alignment was used. The following additional parameters were used: Gap 
opening penalty of 10, Gap extension penalty of 0.05. Gap separation penalty range of 8. Pair- 

10 wise alignment parameters were Ktuple = 1. gap penalty = 3, gap length opening penalty = 10, 
gap extension penalty = 0.1, window size = 5 and diagonals = 5. 

The degree of identity between two nucleotide sequences may be determined using the 
same algorithm and software package as described above for example with the following set- 
tings: Gap penalty of 10. and gap length penalty of 10. PainA^ise alignment parameters were 

1 5 Ktuple=3, gap penalty=3 and windows=20. 

The tenm ""fragment" as used herein about a fragment of an xylanase of the invention, is 
to be understood as a polypeptide having one or more amino acids deleted from the amino 
and/or carboxyl terminus of the amino add sequence of the xylanase, while retaining the car- 
bohydrate activation of the xylanase. 

20 The term "allelic varianr as used herein about allelic variants of a polynucleotide of the 

invention is to be understood as any of two or more alternative forms of a gene occupying the 
same chromosomal locus. Allelic variation arises naturally through mutation, and may result in 
poKymorphJsm within populations. Gene mutations can be silent (no change In the encoded 
polypeptide) or may encode polypeptides having altered amino acid sequences. An allelic 

25 variant of a polypeptide is a polypeptide encoded by an allelic variant of a gene. 

The term "modification(s)" as used herein about modified polypeptides or modified 
polynucleotides is intended to mean any chemical modification of the xylanase as well as ge- 
netic manipulation of the polynucleotide encoding the xylanase. The modification(s) can be re- 
placement(s) of the amino acid side chaln(s), substitution(s). deletion(s) and/or insertions(s) in 

30 or at the amino acid(s) of interest. 

The term "artificial variant" as used herein is to t>e understood as an modified xylanase 
capable of activating carbohydrates, which has been produced by an organism which is ex- 
pressing a modified gene as compared to the unmodified polynucleotide encoding the unmodi- 
fied xylanase The modified polynucleotide (gene), from which said variant is produced when 

35 expressed in a suitable host, is obtained through human intervention. 

The term "cDNA" as used herein is intended to mean a DNA molecule which can be 
prepared by reverse transcription from a mature, spliced. mRNA molecule derived from a eu- 
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karyotic cell. cDNA lacks the Intron sequences that are usually present in the corresponding 
genomic DNA, The initial, primary RNA transcript is a precursor to mRNA and it goes through 
a series of processing events before appearing as mature spliced mRNA. These events In- 
clude the removal of intron sequences by a process called splicing. When cDNA is derived 
5 from mRNA it therefore lacks Intron sequences. 

The term '*nucleic acid construct" as used herein is to be understood as a nucleic acid 
molecule or polynucleotide, either single- or double-stranded, which is isolated from a naturally 
occurring gene or which has been modified to contain segments of nucleic adds in a manner 
that would not othenvise exist in nature. The term nucleic acid construct is synonymous with 

10 the term 'expression cassette" when the nucleic acid construct contains the control sequences 
required for expression of a coding sequence of the present invention. 

The term "control sequence** as used herein is to be understood as nucleotide se- 
quence including all components, which are necessary or advantageous for the expression of 
a polypeptide of the present invention. Each control sequence may be native or foreign to the 

15 nucleotide sequence encoding the polypeptide. Such control sequences Include, but are not 
limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide 
sequence, and transcription terminator. At a minimum, the control sequences include a pro- 
moter, and transcriptional and translational stop signals. The control sequences may be pro- 
vided with linkers for the purpose of introducing specific restriction sites facilitating ligation of 

20 the control sequences with the coding region of the nucleotide sequence encoding a polypep- 
tide. 

The term *'operably linked" as used herein is to be understood as a configuration in 
which a control sequence is appropriately placed at a positton relative to a nucleotide se- 
quence encoding an xylanase such that the control sequence directs the expression of the xy- 
25 lanase. 

The term "coding sequence'* as used herein is to be understood as a nucleotide se- 
quence, which directly specifies the amino acid sequence of the xylanase. The boundaries of 
the coding sequence are generally determined by an open reading frame, which usually begins 
with the ATG start codon. The coding sequence typically includes DNA. cDNA. and recombi- 
30 nant nucleotkie sequences. 

The term "expression" as used herein is to be understood as including any step in- 
volved in the production of the xylanase including, but not limited to, transcription, post- 
transcriptional modification, translation, post-translational modification, and secretion. 

The term "expression vector" as used herein Is to be understood as a polynucleotide 
35 molecule, linear or circular, that comprises a segment encoding an xylanase of the invention, 
and which is operably linked to additional segments that provide for its transcription. 
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The term "host cell'* as used herein. Includes any cell type which is susceptible to trans- 
fomnation with a nudeic acid construct. 

The terms "polynucleotide probe", '^hybridization" as well as the various stringency con- 
ditions are defined infm, 

5 GH-61 xylanases 

The G H-61 xylanases of the Invention all share the characteristics described supra. 
Preferably, the mature GH-61 xylanase comprise in its naturally occurring fonn the conserved 
portions of 



10 a HIsitldine (H) at position 1 , 

A or P at position 59, 

G at position 60, 

G at position 75, 

P or A at position 76. 
15 W or Fat position 100, 

For Tat position 101, 

K or Cat position 102. 

I or V or L at position 1 03, 

L or I or V or M at position 1 30, 
20 Pat position 131, 

G. Xaa and Y at position 137-139 

L or V or 1 or M at position 140, 

L or V or I or M at position 141, 

Rat position 142, 
25 E or Q at positions 143-144. 

L or V or I at position 148, 

H or Nat position 149, 

C at position 163 and 

P and G and P at position 209-21 1 , 
30 when adopting the above mentioned numbering system. 

In particular the xylanase is an isolated xylanase meaning that a preparation of the xy- 
lanase contains at the most 90% by weight of other polypeptide material with which it may be 

natively associated (lower percentages of other polypeptide material are preferred, e.g. at the 

most 80% by weight, at the most 60% by weight, at the most 50% by weight, at the most 40% 
35 at the most 30% by weight, at the most 20% by weight, at the most 10% by weight, at the most 

9% by weight .at the most 8% by weight, at the most 6% by weight, at the most 5% by weight, 
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at the most 4% at the most 3% by weight, at the most 2% by weight, at the most 1% by weight 
and at the most i4% by weight). Thus, it is preferred that the isolated xylanase is at least 92% 
pure, i.e. that the xylanase constitutes at least 92% by weight of the total polypeptide material 
present in the preparation, and higher percentages are preferred such as at least 94% pure« at 
5 least 95% pure, at least 96% pure, at least 96% pure, at least 97% pure, at least 98% pure, at 
least 99%, and at the most 99.5% pure. In particular, it is preferred that the xylanases dis- 
closed herein are in "essentially pure form". I.e. that the xylanase preparation is essentially free 
of other polypeptide material with which It is natively associated. This can be accomplished, for 
example, by preparing the xylanase by means of well-known recombinant methods. 
10 The xylanase of the invention may be synthetically made, naturally occuning or a com- 

bination thereof. In a particular embodiment the xylanase of the Invention may be obtained 
from a microorganism such as a prokaryotic cell, an archaeal cell or a eucaryotic cell. The cell 
may further have been modified by genetic engineering (cf. Sources of GH-61 xylanases, vide 
infra). 

15 In another particular embodiment, the xylanase of the inventk>n have size from about 5 

kDa to about 500 kOa, in particular from about 10 kDa to about 250 kDa, more particularly 
from about 20 kDa to 100 kDa. 

in a further embodiment, the xylanase of the invention may exhibit optimum xylan hy- 
drolysis at pH 5-9, 

20 In a still further embodiment, the xylanase of the invention may exhibit optimum xylan 

hydrolysis at a temperature within the range from about 10*^0 to about 80 **C, particularly in the 
range from about 20**C to about 60"C. 

In a still further embodiment, the xylanase of the invention may be functionally stabile 
over at a temperature of up to 100 X in particular up to 80 **C, more particularly up to SO'C. 

25 In a still further embodiment, the xylanase is specific for xylan and in a particular em- 

bodiment the xylanase comprises a binding domain, such as a xylan binding domain (XBD). 

In a particular embodiment, the xylanase exhibit at least 20%, in particular at least 
40%, such as at least 50%. in particular at least 60%. such as at least 70%, more particularly 
at least 80%, such as at least 90%. most particularly at least 95%. such as about or at least 

30 100% of the xylanase activity of either of the xylanases consisting of the amino add se- 
quences shown as the mature xylanase of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. The 
xylanase preferably comprises or consists of, an amino ackl sequence which has a degree of 
identity to the amino acids of the mature xylanases of SEQ ID N0:2. SEQ ID NO:4 or SEQ ID 
N0:6 of at least 65%, particularly at least 70%, e.g. at least 75%, more particularly at least 

35 80%. such as at least 85%. even more particularly at least 90%, most particularly at least 95%, 
e.g. at least 96%, such as at least 97%, and even most particularty at least 98%. such as at 
least 99% (hereinafter "homologous xylanases"), with the proviso that the xylanase belong to 
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the GH-61 family and that the xylanase preferably comprise the above mentioned conserved 
amino acid portions. 

In an interesting embodiment, the amino add sequence differs by at the most ten 
amino acids (e.g. by ten amino acids), in particular by at the most five amino acids (e.g. by five 
5 amino acids), such as by at the most four amino acids (e.g. by four amino acids), e.g. by at the 
most three amino acids (e.g. by three amino acids) from amino acids of the mature xylanases 
of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6, observing the above mentioned proviso. In a 
particular interesting embodiment, the amino add sequence of the xylanase differs by at the 
most two amino adds (e.g. by two amino adds), such as by one amino add from amino acids 
10 of the mature xylanases OF SEQ ID NO:2, SEQ JD NO:4 or SEQ ID NO:6. 

Aligning the xylanases consisting of the amino add sequence shown as amino adds of 
the mature xylanases of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 with the dosest prior art 
and using the aligment method d escribed above (see the section entitled "Defimtions"). the 
closest known sequence is Cel61A a Trichoderma reeset (Hypocrea jecorina) endo*1,4< 
15 glucanaselV. 

Particularly, the xylanase comprises the amino acid sequence of the mature )^anases 
of SEQ ID N0:2, SEQ ID NO:4 and SEQ ID NO:6; an allelic variant thereof; or a fragment 
thereof that has the xylanase activity. 

In a further particular embodiment, the xylanase consists of amino adds of the mature 
20 xylanases of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6 

The xylanase of the invention may be a wild-type xylanase identified and isolated from 
a natural source. 

Furthermore, the xylanase of the invention may be prepared by the DNA shuffling tech- 
nique, such as described in J.E. Ness et al. Nature Biotechnology 17, 893-896 (1999). 

25 The present invention also encompasses artifidal variants of the xylanases of the in- 

vention, where said xylanases has been mutated by adding, substituting and/or deleting one or 
more amino acids from said xylanase polypeptides while retaining the xylanase activity and the 
conserved portions. Hence, the xylanase of the invention may be an artificial variant of a GH- 
61 xylanase, wherein at least one substitution, deletion and/or insertion of an amino add has 

30 been made to the amino acid sequence in a parent GH-61 xylanase amino add sequence. In 
particular the artifidal variant comprises, in particular consists of, an amino add sequence that 
has at least one substitution, deletion and/or insertion of an amino acid as compared to amino 
acids the mature xylanases of SEQ ID NO:2. SEQ ID NO:4 of SEQ ID NO:6. In particular the 
mutation is made outside the conserved portion of the sequence or a substitution may be 

35 made within a conserved portion while maintaining the xylanase in the GH-61 family. For ex- 
ample for position 76 P may be substituted by A, while sill being conserved according to the 
definition. 
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Such artificiai variants may be constructed by standard techniques known in the art 
usually followed by screening and/or characterization. Standard techniques Includes classical 
mutagenesis, e.g. by UV irradiation of the cells or treatment of cells with chemical mutagens as 
described by Gerhardt et al. (1994); in vivo gene shuffling as described in WO 97/07205; in 
5 vitro shuffling as described by Stammer, (1994) or WO 95/17413, random mutagenesis as de- 
scribed by Eisenstadt E. et al.. (1994); PGR techniques as described by Poulsen et al. (1991); 
family shuffling as described by J.E. Ness, et al, Nature Biotechnology, vol. 17, pp. 893-896 
(1999); site-directed mutagenesis as described by Sambrook et al. (1989). Sambrook et al.. 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. A general description of nu- 
10 cleotide substitution can be found in e.g. Ford et al., 1991, Protein Expression and Purification 
2, p. 95-107. 

Such standard genetic engineering methods may also be used prepare a diversified li- 
brary of variant nucleotide sequences from the genes encoding one or more parent xylanases, 
expressing the xylanase variants in a suitable host cell and selecting a preferred variant(s). A 
15 diversified library can be established by a range of techniques known to the art (Reetz MT; 
Jaeger KE, in Btocatalysis - from Discovery to Application edited by Fessner WD, Vol. 200, 
pp. 31-57 (1999); Stemmer, Nature, vol. 370. p.389-391. 1994; Zhao and Arnold. Proc. Natl. 
Acad. Sci.. USA, vol. 94, pp. 7997-8000, 1997; or Yano et al., Proc. NaU. Acad. Sci., USA, vol. 
95, pp 5511-5515. 1998). 

20 In one embodiment of the Invention, amino acid changes (in the artificial variant as well 

as in wild-type polypeptides) are of a minor nature, that is conservative amino acid substitu- 
tions that do not significantly affect the fokJing and/or activity of the protein; small deletions, 
typically of one to about 30 amino acids; smalt amino- or carboxyl-terminal extensions, such as 
an amino-termlnal methionine resklue; a small linker peptide of up to about 20-25 residues; or 

25 a small extenston that facilitates purification by changing net charge or another function, such 
as a poly-histidine tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids (ar- 
ginine. lysine and histidine), acidic amino acids (glutamic acid and aspartic add), polar amino 
acids ( glutamine a nd a sparagine), h ydrophobic a mino a dds ( leucine, i soleudne, v aline a nd 

30 methionine), aromatic amino adds (phenylalanine, tryptophan and tyrosine), and small amino 
adds (glycine, alanine, serine and threonine). Amino add substitutions which do not generally 
alter and or Impair the functton of a protein are known in the art and are descrit>ed, for exam- 
ple, by H. Neurath and R.L. Hill, 1979. In, The Proteins. Academic Press. New York. The most 
commonly occurring exchanges are Ala/Ser, Val/lle. Asp/Glu, Thr/Ser. Ala/Gly. Ala/Thr. 

35 Ser/Asn. AlaA/al, Ser/Gly. Tyr/Phe, Ala/Pro. Lys/Arg. Asp/Asn. Leu/lle, Leu/Val. Ala/Glu. and 
Asp/Gly as well as these in reverse. 
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In an interesting embodiment of the invention, the amino acid changes are of such a 
nature that the physioo-chemicai properties of the polypeptides are altered. For example, 
amino acid changes may be performed, which improve the themnal stability of the polypeptide, 
which alter the substrate specificity, which changes the phi optimum, and the like. 
5 Particularly, the number of such substitutions, deletions and/or insertions tn the xy- 

lanase of the invention, particularly those of the mature xylanases of SEQ ID NO:2. SEQ ID 
N0:4 or SEQ ID NO:6 to produce an artificial variant is at the most 10. such as at the most 9. 
e.g. at the most 8. more preferably at the most 7. e.g. at the most 6. such as at the most 5. 
most preferably at the most 4, e.g. at the most 3. such as at the most 2. in particular at the 
10 mostl. 

In a particular embodiment, the artificial variant is a variant, which has an altered, pref- 
erably reduced, immunogenicity. especially allergenicity, in animals including man as com- 
pared to a parent xylanase. The term "immunogenicity" in this context is to be understood as 
the artificial variant Invoking a an altered, in particular reduced, immunological response when 

15 administered to an animal, including intravenous, cutaneous, subcutaneous, oral and intratra- 
cheal administration. The term "immunological response" in this context means that the ad- 
ministration of the artiflcia) variant causes an alteration In the immunoglot>olin levels in the 
animal body, such as in IgE, IgG and IgM or an alteration in the cytokine level in the animal 
body. Methods for mapping immunogenic/antigenic epitopes of a protein, preparing variants 

20 with altered immunogenicity and methods for measuring an immunological response is well 
known to the art and are described e.g. in WO 92/10755. WO 00/26230. WO 00/26354 and 
WO 01/31989. 

In a further embodiment, the present inventton relates to GH-61 xylanases which are 
encoded by nucleotide sequences which hybridize under very low stringency conditions, par- 
25 ticutarly under low stringency conditions, more particulariy under medium stringency condi- 
tions, more particulariy under medium-high stringency conditions, even more particulariy under 
high stringency conditions, and most particularly under very high stringency conditions with a 
polynudeotlde probe selected from the group consisting of 

(i) the complementary strand of nucleotides 52 to 699 of SEQ ID NO:1 , 46 to 957 of SEQ 
30 ID NO:3 or 58 to 660 of SEQ ID NO:5. 

(ii) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 of 
SEQ ID N0:1 . 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3. 

(iv) the complementary strand of nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of SEQ 
35 ID NO:3 or 58 to 300 of SEQ ID NO:5 and/or 

(v) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1. 502 to 957 of 
SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5. 
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(J. Sambrook. E.F. Fritsch. and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual. 
2d edition. Cold Spring Harbor, New York). 

In particular, the xylanase of the invention is encoded by a polynucleotide comprising 
5 the nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1 , 46 to 957 of SEQ ID N0:3 
or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID NO:1. 46 to 957 
of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the genetic code. 
More particularly, the xylanase of the Invention is encoded by a polynucleotide consisting of 
nucleotide sequence of nucleotides 52 to 699 of SEQ 10 NO:1. 46 to 957 of SEQ ID NO:3 or 

10 58 to 660 of SEQ ID N0:5 or sequences differing from 52 to 699 of SEQ ID NO:1 , 46 to 957 of 
SEQ ID N0:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the genetic code. 

The nucleotide sequences of nucleotides SEQ ID N0:1, SEQ ID NO:3 or of SEQ ID 
NO:5 or a subsequence thereof, as well as the amino acid sequences of SEQ ID NO:2, SEQ 
ID NO:4 or SEQ ID NO:6 or a fragment thereof, may be used to design a polynucleotide probe 

15 to identify and clone DNA encoding GH-61 xylanases of the Invention from strains of different 
genera or species according to methods well known in the art. In particular, such probes can 
be used for hybridization with the genomic or cDNA of the genus or species of interest, follow- 
ing standard Southern blotting procedures, in order to identify and isolate the conresponding 
gene therein. Such probes can be considerably shorter than the entire sequence, but should 

20 be at least 1 5, preferably at least 25. more preferably at least 35 nucleotides in length, such as 
at least 70 nucleotides In length. It is, however, preferred that the polynucleotide probe is at 
least 100 nucleotides in length. For example, the polynucleotide probe may be at least 200 nu- 
cleotides in length, at least 300 nucleotides in length, at least 400 nucleotides in length or at 
least 500 nucleotides In length. Even longer probes may be used, e.g., polynucleotide probes 

25 which are at least 600 nucleotides in length, at least 700 nucleotides in length, at least 800 nu- 
cleotides in length, or at least 900 nucleotides in length. Both DNA and RNA probes can be 
used. The probes are typically labeled for detecting the corresponding gene (for example, with 
32p^ 353 jji^^jj^^ or avidin). 

Thus, a genomic DNA or cDNA library prepared from such other organisms may be 
30 screened for DNA which hybridizes with the probes described above and which encodes the 
xylanases of the inventk)n. Genomic or other DNA from such other organisms may be sepa- 
rated by agarose or polyacrylamide gel electrophoresis, or other separation techniques. DNA 
from the libraries or the separated DNA may be transferred to, and immobilized, on nitrocellu- 
lose or other suitable carrier materials. In order to identify a clone or DNA which has the re- 
35 quired homology and/or identity or is homologous and/or identical with of nucleotides 52 to 699 
of SEQ ID NO:1. 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 the carrier material 
with the immobilized DNA rs used in a Southern blot. 
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For purposes of the present invention, hybridization indicates that the nudeotide se- 
quence hybridizes to a labeled polynucleotide probe which again hybridizes to the nucleotide 
sequence shown in nucleotides 52 to 699 of SEQ ID NO:1 » 46 to 957 of SEQ ID NO:3 or 58 to 
660 of SEQ ID NO:5 under very low to very high stringency conditions. Molecules to which the 
5 polynucleotide probe hybridizes under these conditions may be detected using X-ray film or by 
any other method known in the art. Whenever the term "polynucleotide probe" is used in the 
present context, it is to be understood that such a probe contains at least 15 nucleotides. 
In particular the probe comprise nucleotides encoding conserved portions of a Hisitidine (H) at 
position 1, A or P at position 59, 6 at position 60, 6 at position 75, P or A at position 76, W or 

10 F at position 100, F or T at position 101, K or C at position 102, 1 or V or L at position 103, L or 
I or V or M at position 130, P at position 131, G and Xaa and Y at position 137-139. L or V or I 
or M at position 140, L or V or I or M at position 141 . R at position 142, E or Q at positions 143- 
144, L or V or I at position 148, H or N at position 149, C at position 163 and P and G and P at 
position 209-21 1 in the mature xylanase. 

15 In an interesting embodiment, the polynucleotide probe is the complementary strand of 

nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of SEQ ID NO:3 or 58 to 300 of SEQ ID 
NO:5. In another embodiment the polynucleotide probe is the complementary strand of nucleo- 
tides 301 to 699 of SEQ ID NO:1 , 502 to 957 of SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5. 
In another Interesting embodiment, the polynucleotide probe is the complementary 

20 strand of the nucleotide sequence which encodes the polypeptide of SEQ ID NO:2, SEQ ID 
NO:4 or SEQ ID NO:6 or the mature xylanases thereof. In a further interesting embodiment, 
the polynucleotide probe is the complementary strand of SEQ ID NO:1, SEQ ID NO:3 or SEQ 
ID NO:5. In a still further interesting embodiment, the polynucleotide prot>e is the complemen- 
tary strand of the mature polypeptide coding region of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID 

25 NO:5. 

For long probes of at least 100 nucleotides in length, very low to very high stringency 
conditions are defined as prehybridization and hybridization at 42*C in 5X SSPE, 1.0% SDS. 
5X Denhardt's solution, 100 pg/ml sheared and denatured salmon sperm DNA, following stan- 
dard Southern blotting procedures. Preferably, the long probes of at least 100 nucleotides do 

30 not contain nriore than 1000 nucleotides. For long probes of at least 100 nucleotides in length, 
the carrier material is finally washed three times each for 15 minutes using 2 x SSC, 0.1 % SDS 
at 42*^0 (very low stringency), preferably washed three times each for 15 minutes using 0.5 x 
SSC, 0.1% SDS at 42*C (low stringency), more preferably washed three times each for 15 
minutes using 0.2 x SSC. 0.1% SDS at 42''C (medium stringency), even more preferably 

35 washed three times each for 15 minutes using 0.2 x SSC, 0.1% SDS at 55^0 (mediunrvhigh 
stringency), most preferably washed three times each for 15 minutes using 0.1 x SSC, 0.1% 
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SDS at 60*C (high stringency), in particular washed three times each for 15 minutes using 0.1 
X SSC. 0.1% SDS at 68*C (very high stringency). 

Although not particularly preferred, it is contemplated that shorter probes, e.g. probes 
which are from about 15 to 99 nucleotides in length, such as from about 15 to about 70 nucleo- 
5 tides in length, may be also be used. For such short probes, stringency conditions are defined 
as p rehybridization, hybridization, a nd washing post-hybridization at 5 ""C to 1 0^C below the 
calculated Tm using the calculation according to Bolton and McCarthy (1962, Proceedings of 
the National Academy of Sciences USA 48:1390) In 0.9 M NaCI, 0.09 M Tris-HCI pH 7.6, 6 mM 
EDTA. 0.5% NP-40. IX Oenhardt's solution, 1 mM sodium pyrophosphate, 1 mM sodium 
10 monobasic phosphate, 0.1 mM ATP, and 0.2 nng of yeast RNA per ml following standard 
Southern blotting procedures. 

For short probes which are about 15 nucleotides to 99 nucleotides in length, the carrier 
material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 15 min- 
utes using 6X SSC at 5''C to lO'^C below the calculated Tm. 

1 5 Sources of xylanases 

The xylanase of the present Invention may be obtained from microorganisms of any 
genus. For purposes of the present invention, the term "obtained from", as used herein shall 
mean that the xylanase encoded by the nucleotide sequence is produced by a cell in which the 
nucleotide sequence Is naturally present or into which the nucleotide sequence has been 
20 inserted. 

Further xylanases obtainable from microorganisms are in a particular embodiment an 
extracellular xylanases Le. a xylanase which is secreted or otherwise exported from a 
microorganism to its surrounding medium. 

Eukarvotic sources 

25 The xylanase of the invention may be obtained from eukaryotes particulariy, plant cells 

or fungi. Particularly, the xylanase may be derived from fungi that degrade carbohydrates, 
such as cellulosic substrates. Such fungi include e.g. Ascomycota, Basidiomycota, Zygomy- 
cota or Oomycota. In particular Vertidllium tenerum. Coprinus cinerius, Diplodia gossypinna, 
Humicola insolens, Dichotomocladium hesseltinei. Pseudoplectania nigrella. Psilocybe in- 

30 quillna and Trichophaea saccata. 

Other relevant fungi may be yeasts such as a Candida, Kiuyveromyces, Pichia, Sac- 
charomyces, Schizosaccharomyces, or Yanx>wia or filamentous fungi such as an Acremonium, 
Aspergillus, Aureobasidium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, 
Mucor, Myceliophthora, Neocallimastix, Neurospora, Paedlomyces, Penicillium, Piromyces, 

35 Schizophylium, Talaromyces, Thermoascus, Thielavia, Tolypocladium, or Trichoderma. 



15 



1 0274.01 0-DK 



in an interesting embodiment, the xylanase is derived from Saccharomyces carlsber- 
gensis, Saccharomyces cerevislae, Saccharomyces diastaticus, Saccharomyces douglasii, 
Saccharomyces kluyveri, Saccharomyces nortensis or Saccharomyces oviformis. 

In another interesting embodiment, the xylanase is derived from Aspergillus aculeatus^ 
5 Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus r^idulans, Asper- 
gillus niger, Aspergillus oryzae, Fusarium bactridioides, Fusan'um cerealis, Fusarium crookwel- 
lense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium hetero- 
sporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, 
Fusarium sambucirtum, Fusarium sarcochroum, Fusarium sporotrichtoides, Fusarium sul- 
10 phureum, Fusarium torulosum, Fusarium tricht^hecioides, Fusarium venenatum, Humicola in- 
solerts, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, 
Penicillium purpurogenum, Trichoderma harzianum, Trichoderma koningii, Trichoderma longi- 
brachiatum, Trichoderma reesei, or Trichoderma viride. 

It will be understood that for the aforementioned species, the invention encompasses 
15 both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs. re- 
gardless of the species name by which they are known. Those skilled in the art will readily rec- 
ognize the identity of appropriate equivalents. 

Strains of these species are readily accessible to the public in a number of culture col- 
lections, such as the American Type Culture Collection (ATCC). Deutsche Sammlung von Mik- 
20 roorganlsmen und Zellkulturen GmbH (DSM). Centraalbureau Voor Schimmelcultures (CBS), 
and Agricultural Research Service Patent Culture Collection. Northern Regional Research 
Center (NRRL). 

Furthermore, such xylanases may be identified and obtained from other sources includ- 
ing microorganisms isolated from nature (e.g., soil, composts, water, etc.) using the above- 

25 mentioned probes. Techniques for isolating microorganisms from natural habitats are well 
known in the art. The nucleotide sequence may then be derived by similariy screening a ge- 
nomic or cDNA library of another microorganism. Once a nucleotide sequence encoding a xy- 
lanase has been detected with the probe(s). the sequence may be isolated or cloned by utiliz- 
ing techniques which are known to those of ordinary skill in the art (see, e.g., Sambrook et ai, 

30 1989, supra). 

Xylanases encoded by nucleotide sequences of the present invention also include 
fused xylanases or deavable fusion xylanases in which another xylanase is fused at the N- 
terminus or the C-terminus of the xylanase or fragment thereof. A fused xylanase is produced 
by fusing a nucleotide sequence (or a portion thereoO encoding another xylanase to a nucleo- 
35 tide sequence (or a portion thereof) of the present invention. Techniques for producing fusion 
xylanases are known in the art. and include ligating the coding sequences encoding the xy- 
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lanases so that they are in frame and that expression of the fused xylanase Is under control of 
the same pronioter(s) and terminator. 

Nucleotide sequences 

The present invention also relates to polynucleotides comprising a nucleotide se- 
5 quence, which encodes a GH-61 xylanase^ In particular, the present invention relates to 
polynucleotides consisting of a nucleotide sequence, which encodes a xylanase of the inven- 
tion. In a particular emt>odiment, the nucleotide sequence is set forth in SEQ ID NO:1. SEQ ID 
NO:3 or SEQ ID NO:5. In a more particular emtxxilment. the nucleotide sequence is the ma- 
ture xylanase coding region of SEQ ID N0:1. SEQ ID NO:3 or SEQ ID NO:5. 

10 The present invention also encompasses polynucleotides comprising, preferably com- 

prising or consisting of. nucleotide sequences which encode a xylanase consisting of the 
amino acid sequence of SEQ ID NO:2. SEQ ID N0:4 or SEQ ID NO:6. or the mature xylanase 
thereof including nucleotide sequences differing from SEQ ID NO:1, SEQ ID NO:3 or SEQ ID 
NO:5 by virtue of the degeneracy of the genetic oode. 

15 The present invention also relates to polynucleotides comprising, preferably consisting 

of, a subsequence of: 

SEQ ID NO:1 which encode fragments of SEQ ID N0:2 that exhibit xylanase activity 
and contains the conserved portions. 

SEQ ID NO:3 which encode fragments of SEQ ID N0:4 that exhibit xylanase activity 
20 and contains the conserved portions. 

SEQ ID NO:5 which encode fragments of SEQ ID NO:6 that exhibit xylanase activity 
and contains the conserved portions. 

A subsequence of SEQ ID NO:1, SEQ ID N0:3 or SEQ ID NO:5 is a nucleotide se- 
25 quence encompassed by SEQ ID NO:1. SEQ ID NO:3 or SEQ ID NO:5 except that one or 
more nucleotides from the 5' and/or 3* end have been deleted. 

The present invention also relates to polynucleotides comprising, preferably consisting 
of, a modified nucleotide sequence which comprises at least one modification/mutation in the 
mature xylanase coding sequence of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5, and where 
30 the modified nucleotide sequence encodes a xylanase comprising a H at position 1 , A or P at 
position 59, G at position 60, G at position 75, P or A at position 76, W or F at position 100. F 
or T at position 101, K or C at position 102, I or V or L at position 103, L or 1 or V or M at 
position 130. P at position 131, G and Xaa and Y at position 137-139. L or V or I or M at 
position 140, L or V or 1 or M at position 141, R at position 142, E or Q at positions 143-144, L 
35 or V or I at position 148, H or N at position 149. C at position 163 and P and G and P at 
position 209-211. 
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The techniques used to isolate and/or done a nucleotide sequence encoding an xy- 
ianase are known in the art and include isolation from genomic DNA. preparation from cDNA, 
or a combination thereof. The cloning of the nucleotide sequences of the present invention 
5 from such genomic DNA can be effected, e.g., by using the well known polymerase chain reac- 
tion (PCR) or antibody screening of expression libraries to detect cloned DNA fragments with 
shared structural features. See, e.g., Innis et al., 1990, PCR: A Guide to Methods and Applica- 
tion, Academic Press, New York. Other amplification procedures such as ligase chain reaction 
(LCR). ligated activated transcription (LAT) and nucleotide sequence-based amplification 

10 (NASBA) nuiy be used. The nucleotkje sequence may be cloned from a strain of Humicola or 
Coprinus, or another or related organism and thus, for example, may be an allelic or species 
variant of the xylanase encoding region of the nucleotide sequence. 

The nucleotide sequence may be obtained by standard cloning procedures used in ge- 
netic engineering to relocate the nucleotide sequence from its natural location to a different site 

15 where it will be reproduced. The cloning procedures may involve excision and isolation of a 
desired fragment comprising the nucleotide sequence encoding the xylanase, insertion of the 
fragment Into a vector molecule, and incorporation of the recombinant vector into a host cell 
where multiple copies or clones of the nucleotide sequence will be replicated. The nucleotide 
sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any combina- 

20 tions thereof. 

The present invention also relates to a polynucleotide having, preferably consisting of, 
a nucleotide sequence which has at least 65% identity with nucleotides 52 to 699 of SEQ ID 
NO:1, 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID N0:5. Particulariy, the nucleotide se- 
quence has at least 70% identity, e.g. at least 80% identity, such as at least 90% identity, more 

25 preferably at least 95% Identity, such as at least 96% klentity. e.g. at least 97% klentity, even 
more preferably at least 98% identity, such as at least 99% with nucleotides 52 to 699 of SEQ 
ID NO:1. 46 to 957 off SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. Preferably, the nucleotide 
sequence encodes a xylanase. The degree of identity between two nucleotide sequences is 
determined as described previously (see the section entitled "Definitions"). Particulariy, the nu- 

30 deotMe sequence comprises nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ ID 
N0:3 or 58 to 660 of SEQ ID NO:5. In an even more particular embodiment, the nucleotide se- 
quence consists of nucleotkies 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ ID NO:3 or 58 to 
660 of SEQ ID NO:5. 

Modification of a nucleotide sequence encoding an xylanase of the present invention 
35 may be necessary for the synthesis of an xylanase. which comprises an amino acid sequence 
that has at least one substitution, deletion and/or insertion as compared to amino acids 18-233 
of SEQ ID NO:2. 16-319 of SEQ ID NO:4 or 20-220 of SEQ ID NO:6. These artificial variants 
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may differ in properties or other way from the xylanase isolated from its native source, e.g., 
variants may differ in cartxohydrate activation capabilities, carbohydrate specificity, themiosta- 
bnity, pH optimum or the like. 

It will be apparent to those skilled in the art that such modifications can be made to 
5 preserve the membership of the xylanase to the GH-61 homology group and can be made out- 
side regions critical to the function of the molecule regions and still result In a xylanase. Amino 
acid residues essential to the function or the GH-61 characteristics of the xylanase encoded by 
the nucleotide sequence of the invention are therefore preferably not subject to modification, 
such as substitution. Amino add residues essential to the function may be identified according 

10 to procedures known in the art, such as site-directed mutagenesis or alanine-scanning 
mutagenesis (see, e.g., Cunningham and Wells, 1989, Sdence 244: 1081-1085). In the latter 
technique, mutations are introduced at every positively charged residue in the molecule, and 
the resultant mutant molecules are tested for the carbohydrate activation function to identify 
amino acid residues that are critical to the function of the molecule. Sites of carbohydrate- 

1 5 xylanase interaction can also be determined by analysis of the three-dimensional structure as 
determined by s uch techniques a s n udear magnetic resonance a natysis, crystallography or 
photoaffinity labelling (see, e.g., de Vos et at., 1992, Science 255: 306-312; Smith et al, 1992, 
Journal of Molecular Biology 224: 899-904; Wlodaver ef a/.. 1992, FEBS Letters 309: 59-64). 

Moreover, a nucleotide sequence encoding a xylanase of the present invention may be 

20 modified by introduction of nucleotide substitutions which do not give rise to another amino 
acid sequence of the xylanase encoded by the nucleotide sequence, but which correspond to 
the codon usage of the host organism intended for production of the xylanase. 

The introduction of a mutation into the nucleotide sequence to exchange one nudeo- 
tide for another nudeotlde may be accomplished by site-directed mutagenesis using any of the 

25 methods known in the art. Particularly useful is the procedure, which utilizes a supercoiled, 
double stranded DNA vector with an insert of interest and two synthetic primers containing the 
desired mutation. The oligonucleotide primers, each complementary to opposite strands of the 
vector, extend during temperature cycling by means of Pfu DNA polymerase. On incorporation 
of the primers, a mutated plasmid containing staggered nicks is generated. Following tem- 

30 perature cycling, the product is treated with Dpn\ which is specific for methylated and hemi- 
methylated DNA to digest the parental DNA template and to select for mutation-containing syn- 
thesized DNA. Other procedures known in the art may also be used. For a general description 
of nucleotide substitution, see, e.g. Ford et al., 1 991 , Protein Expression and Purification 2: 95- 
107. 

35 The present invention also relates to a polynucleotide having, preferably consisting of. 

a nucleotide sequence which encodes a GH-61 xylanase and which hybridizes under very low 
stringency conditions, preferably under low stringency conditions, more preferably under me- 
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dium stringency conditions, more preferably under medium-high stringency conditions, even 
more preferably under high stringency conditions, and most preferably under very high strin- 
gency conditions with a polynucleotide probe selected from the group consisting of: 

5 (I) the complementary strand of nucleotides 52 to 699 of SEQ ID N0:1 , 46 to 957 of SEQ 
ID NO:3 or 58 to 660 of SEQ ID NO:5. 

(ii) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 of 
SEQ ID NO:1. 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3, 

10 (iv) the complementary strand of nucleotides 52 to 300 of SEQ ID N0:1, 46 to 501 of SEQ 
ID NO:3 or 58 to 300 of SEQ ID NO:5. 
(v) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1, 502 to 957 of 
SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5, 

15 (J. Sambrook. E.F. Fritsch. and T. Maniatus, 1989. Molecular Cloning, A Laboratory Manual, 
2d edition. Cold Spring Harbor, New York). 

As will be understood, details and particulars concerning hybridization of the nucleotide 
sequences will be the same or analogous to the hybridization aspects discussed In the section 
20 titled "GH-61 xylanases" herein. 

The present invention also encompass a storage medium, such as a computer disk, 
containing in digital form or in any other form the sequences as mentioned herein. 

Nucleic acid construct comprising nucleotide sequences 

The present invention also relates to nucleic acid constructs comprising a nucleotide 
25 sequence of the invention operably linked to one or more control sequences that direct the ex- 
pression of the coding sequence in a suitable host cell under conditions compatible with the 
control sequences. 

A nucleotide sequence encoding a xylanase of the invention may be manipulated in a 
variety of ways to provide for expression of the xylanase. Manipulation of the nucleotide se- 
30 quence prior to its insertion into a vector may t>e desirable or necessary depending on the ex- 
pressk>n vector. The techniques for modifying nucleotide sequences utilizing recombinant 
DNA methods are well known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleotide se- 
quence which is recognized by a host cell for expression of the nucleotide sequence. The 
35 promoter sequence contains transcriptional control sequences, which mediate the expression 
of the polypeptide. The promoter may be any nucleotide sequence which shows transcrip- 
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tional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and 
may be obtained from genes encoding extracellular or Intracellular polypeptides either ho- 
mologous or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the nudeic acid con- 
5 structs of the present invention, especially in a bacterial host cell, are the promoters obtained 
from the E. coli lac operon, Streptomyces coelicofor agarase gene (dagA), Bacillus subtilis le- 
vansucrase gene (sacB). Bacillus iicheniformis alpha-amylase gene (amyL), Bacillus 
stearothermophilus maltogenic amylase gene {amyM), Bacillus amyloliquefyciens alpha- 
amylase gene (a/nyO). Bacillus Iicheniformis penicillinase gene {penP), Bacilius subWis xylA 

10 and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff at al., 1978, Proceed- 
ings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac promoter 
{DeBoer et al. , 1 983. Proceedings of the National Academy of Sciences USA 80: 2 1 -25). Fur- 
ther promoters are described in "Useful proteins from recombinant bacteria*' in Scientific 
American, 1980. 242: 74-94; and in Sambrool^ et al.. 1989, supra. 

15 Examples of suitable promoters for directing the transcription of the nucleic acid con- 

structs of the present invention in a filamentous fungal host cell are promoters obtained from 
the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, As- 
pergillus niger neutral alpha-amylase. Aspergillus niger add stable alpha-amylase, Aspergillus 
niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus 

20 oryzae alkaline protease, Aspergillus oryzae those phosphate isomerase, Aspergillus nidutans 
acetamidase, and Fusarium oxysporum trypsin-like protease (WO 96/00787), as well as the 
NA2-tpi promoter (a hybrid of the promoters from the genes for Aspergillus niger neutral alpha- 
amylase and Aspergillus oryzae triose phosphate Isomerase), and mutant, truncated, and hy- 
brid promoters thereof. 

25 In a yeast host, useful promoters are obtained from the genes for Saccharomyces cer- 

evisiae enolase (ENO-1). Saccharomyces cerevisiae galactokinase (GAL1), Saccharomyces 
cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP). 
and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful promoters for yeast 
host cells are described by Romanes etal., 1992, Yeast 8: 423-488. 

30 The control sequence may also be a suitable transcription terminator sequence, a se- 

quence recognized by a host cell to terminate transcription. The terminator sequence is oper- 
ably linked to the 3* terminus of the nudeotide sequence encoding the xylanase. Any termina- 
tor which is functional in the host cell of choice may be used in the present invention. 

Preferred terminators for filamentous fungal host cells are obtained from the genes for 

35 Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans an- 
thranilate synthase, Aspergillus n^ger alpha-glucosidase, and Fusarium oxysporum trypsin-like 
protease. 
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Preferred terminators for yeast host cells are obtained from the genes for Saccharomy- 
ces cerevisiae enolase, Saccharomyces cerevi^ae cytochrome C (CYC1 ). and Saccharomy- 
ces cerevisiae glyceraldehyde*3-phosphate dehydrogenase. Other useful terminators for 
yeast host ceils are described by Romanos et ai, 1992, supra, 
5 The control sequence may also be a suitable leader sequence, a non-translated region 

of an mRNA which is important for translation by the host cell. The leader sequence is opera- 
bly linked to the 5* terminus of the nucleotide sequence encoding the polypeptide. Any leader 
sequence that is functional in the host ceil of choice may be used in the present invention. 

Prefen-ed leaders for filamentous fungal host cells are obtained from the genes for As- 
10 pergilius oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase. 

Suitable leaders for yeast host cells are obtained from the genes for Saccharomyt^s 
cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase. Sac- 
charomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydro- 
genase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP). 
15 The control sequence may also be a polyadenylation sequence, a sequence operably 

linked to the 3' temninus of the nucleotide sequence and which, when transcribed, is recog- 
nized by the host cell as a signal to add poiyadenosine residues to transcribed mRNA. Any 
polyadenylation sequence which is functional in the host cell of choice may be used in the pre- 
sent invention. 

20 Prefen^ed polyadenylation sequences for filamentous fungal host cells are obtained 

from the genes for Aspergillus oryzae TAKA amylase, Aspergillus n/ger glucoamylase, Asper- 
gillus niduians anthranilate synthase. Fusarium oxysporum trypsln-like protease, and Aspergil- 
lus n/ger alpha-glucosidase. 

Useful polyadenylatton sequences for yeast host cells are described by Quo and 

25 Shemnan. 1 995, Molecular Cellular Biology 1 5: 5983-5990. 

The control sequence may also be a signal peptide coding region that codes for an 
amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded 
xylanase into the cell's secretory pathway. The 5' end of the coding sequence of the nucleo- 
tide sequence may inherently contain a signal peptide coding region naturally linked In transla- 

30 tion reading frame with the segment of the coding region which encodes the secreted xy- 
lanase. Alternatively, the 5' end of the coding sequence may contain a signal peptide coding 
region which is foreign to the coding sequence. The foreign signal peptide coding region may 
be required where the coding sequence does not naturally contain a signal peptide coding re- 
gion. Alternatively, the foreign signal peptide coding region may simply replace the natural 

35 signal peptide coding region in order to enhance secretion of the xylanase. However, any sig- 
nal peptide coding region which directs the expressed xylanase into the secretory pathway of a 
host cell of choice may be used in the present invention. 
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The signal peptide coding region is nucleotides 1 to 51 of SEQ ID N0:1, 1 to 51 of SEO 
ID NO:3 or 1 to 57 of SEQ ID NO:5 which encode amino adds 18 to 233 of SEQ ID NO:2, 16 
to 319 of SEQ ID NO:4 or 20 to 220 of SEQ ID NO:6. 

Effective signal peptide coding regions for bacterial host cells are the signal peptide 
5 coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase. Bacillus 
stearothermophilus alpha-amylase. Bacillus Jicheniformis subtilisin, Bacillus licheniforwis beta- 
lactamase. Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus 
subtilis prsA. Further signal peptides are described by Sinrionen and Palva, 1993, Microbi- 
ological Reviews 57: 109-137. 

10 Effective signal peptide coding regions for filamentous fungal host cells are the signal 

peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Asper- 
gillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehel aspartic pro- 
teinase, Humicola insoiens cellulase, and Humicola lanuginosa lipase. 

Useful signal peptides for yeast host cells are obtained from the genes for Saccharo- 

15 myces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal 
peptide coding regions are described by Romanes etaL, 1992, supra. 

The control sequence may also be a propeptide coding region that codes for an amino 
acid sequence positioned at the amino terminus of a xylanase. The resultant polypeptide may 
be denoted a pro-xylanase or propolypeptide. A propolypeptide is generally inactive and can 

20 be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the 
propeptide from the propolypeptide. The propeptide coding region may be obtained from the 
genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease {nprT), 
Saccharomyces cerevisiae alpha-factor, Rhizomucor miehel aspartic proteinase, and My- 
celiophthora thermophila laccase (WO 95/33836). 

25 Where both signal peptide and propeptide regions are present at the amino terminus of 

a polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide 
and the signal peptide region is positioned next to the amino terminus of the propeptide region. 

In yeast, the AD1H2 system or GAL1 system may be used. In filamentous fungi, the 
TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus 

30 oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of 
regulatory sequences are those which allow for gene amplification. Ineukaryotic systems, 
these include the dihydrofolate reductase gene which Is amplified in the presence of meth- 
otrexate, and the metallothionein genes which are amplified with heavy metals. In these 
cases, the nucleotide sequence encoding the polypeptide would be operabty linked with the 

35 regulatory sequence. 
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Recombinant expression vector comprising nucleic acid construct 

The present invention also relates to recombinant expression vectors comprising the 
nudeic acid construct of the invention. The various nucleotide and control sequences de- 
scribed above may be Joined together to produce a recombinant expression vector, which may 
5 include one or more convenient restriction sites to allow for insertion or substitution of the nu- 
cleotide sequence encoding the polypeptide at such sites. Alternatively, the nucleotide se- 
quence of the present invention may be expressed by inserting the nucleotide sequence or a 
nucleic acid construct comprising the sequence into an appropriate vector for expression, in 
creating the expression vector, the coding sequence is located in the vector so that the coding 
10 sequence is operably linked with the appropriate control sequences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 
can be conveniently subjected to recombinant DNA procedures and can bring about the ex- 
pression of the nucleotide sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host ceil into which the vector is to be introduced. The vec- 
15 tors may be linear or closed circular piasmlds. 

The vector may t>e an autonomously replicating vector, i\e. a vector which exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal replication, 
e.g. a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromo- 
some. 

20 The vector may contain any means for assuring self-replication. Alternatively, the vec- 

tor may be one which, when introduced into the host cell, is Integrated into the genome and 
replicated together with the chromosome(s) into which it has been integrated. Furthermore, a 
single vector or plasmid or two or more vectors or plasmids which together contain the total 
DNA to be introduced into the genome of the host cell, or a transposon may be used. 

25 The vectors of the present invention preferably contain one or more selectable markers 

which permit easy selection of transformed cells. A selectable marker is a gene the product of 
which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the like. 

Examples o f b acterial s electable m arkers a re the da/ g enes from 6 acilius s ubtilis o r 

30 Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin, kana- 
mycin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells are 
ADE2, HIS3. LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in a filamen- 
tous fungal host cell include, but are not limited to, amdS (acetamidase), argB (ornithine car- 
bamoyltransferase), bar (phosphinolhricin acetyltransferase). hygS (hygromycin phosphotrans- 

35 ferase), n/aD (nitrate reductase). pyrG (orotidine-5'-phosphate decarboxylase), sC (sulfate 
adenyitransferase), trpC (anthranilate synthase), as well as equivalents thereof. 



24 



10274.010-DK 

Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus 
nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygrascoplcus. 

The vectors of the present invention preferably contain an element(s) that permits sta- 
ble integration of the vector Into the host cell's genome or autonomous replication of the vector 
5 in the cell independent of the genome. 

For integration into the host cell genome, the vector may rely on the nucleotide se- 
quence encoding the polypeptide or any other element of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous recombination. Alternatively, 
the vector nnay contain additional nucleotide sequences for directing integration by homolo- 
10 gous recombination into the genome of the host cell. The additional nucleotide sequences en- 
able the vector to be integrated Into the host cell genome at a precise location(s) in the chro- 
mosome(s). To increase the liJcelihood of integration at a precise location, the integrational 
elements should preferably contain a sufficient number of nucleotides, such as 100 to 1.500 
base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1.500 base pairs. 
15 which are highly homologous with the corresponding target sequence to enhance the probabil- 
ity of homologous recombination. The integrational elements may be any sequence that is 
homologous with the target sequence in the genome of the host cell. Furthemiore, the integra- 
tional elements may be non-encoding or encoding nucleotide sequences. On the other hand, 
the vector may be integrated into the genome of the host cell by non-homofogous recombina- 
20 tion. 

For autonomous replication, the vector may further comprise an origin of replication 
enabling the vector to replicate autonomously in the host ceil in question. Examples of bacte- 
rial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177. 
and PACYC184 permitting repllcaUon in E. co//, and pUBHO. pE194, pTA1060. and pAMBI 

25 permitting replication in Bacillus. Examples of origins of replication for use in a yeast host cell 
are the 2 micron origin of replication. ARS1, ARS4, the combination of ARS1 and CEN3, and 
the combination of ARS4 and CEN6. The origin of replication may be one having a mutation 
which makes its functioning temperature-sensitive in the host cell (see, e.g., Ehriich, 1978, 
Proceedings of the National Academy of Sciences USA 75: 1433). 

30 More than one copy of a nucleotide sequence of the present invention may be inserted 

into the host cell to increase production of the gene product. An increase in the copy number 
of the nucleotide sequence can be obtained by integrating at least one additional copy of the 
sequence into the host cell genome or by including an amplifiable selectable mariner gene with 
the nucleotide sequence where cells containing amplified copies of the selectable mariner 

35 gene, and thereby additional copies of the nucleotide sequence, can be selected for by culti- 
vating the cells in the presence of the appropriate selectable agent. 
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The procedures used to ligate the elements described above to construct the recombi- 
nant expression vectors of the present invention are well known to one skilled in the art (see 
e.g. Sambrook etaL, 1989, supra). 

Recombinant host cell comprising nucleic acid construct 

6 The present Invention also relates to recombinant a host cell comprising the nucleic 

acid construct of the invention, which are advantageously used in the recombinant production 
of the polypeptides. A vector comprising a nucleotkje sequence of the present invention Is 
introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a 
self-replicating extra-chromosomal vector as described eariier. 

10 The host cell may be a unicellular microorganism, e.g., a prokaryote or a non- 

unicellular microorganism, e.g.. a eukaryote. 

Useful unicellular cells are bacterial cells such as gram positive bacteria including, but 
not limited to, a Baciilus cell, e.g.. Bacillus afkafophilus. Bacillus amy!oliquefaciens. Bacillus 
brevis. Bacillus circulans, Badllus dausii. Bacillus coagulans. Bacillus lautus. Bacillus lentus, 

15 BaaVus licheniformis. Bacillus megaterium, Bacilius stearothermophilus. Bacillus subWis, and 
Bacillus thuringiensis] or a Straptomyces cell, e.g., Streptomyces IMdans or Straptomyces 
muhnus, or gram negative bacteria such as £. coli and Pseudomonas sp. In a preferred em- 
bodiment, the bacterial host cell is a Bacillus lentus. Bacillus licheniformis, Bacillus 
stearothermophilus, or Bacillus subtilis cell. In another preferred embodiment, the Bacillus cell 

20 is an alkalophilic Bacillus. (KKSC/SALK verify relevans for GH-61 xylanases) 

The introduction of a vector into a bacterial host cell may, for instance, t>e effected by 
protoplast transfonmation (see. e.g., Chang and Cohen, 1979, Molecular General Genetics 
168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteri- 
ology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971. Journal of Molecular Biology 56: 

25 209-221), electroporation (see, e.g., Shigekawa and Dower. 1988, Biotechniques 6: 742-751), 
or conjugation (see, e.g.. Koehler and Thorne, 1987. Journal of Bacteriology 169: 5771-5278). 
The host cell may be a eukaryote, such as a mammalian. Insect, plant, or fungal cell. 
In a preferred embodiment, the host cell is a fungal cell. '*Fungi* as used herein in- 
cludes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined 

30 by Hawksworth e^ a/.. In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 1995, 
CAB Intemational, University Press. Camt>ridge, UK) as well as the Oomycota (as cited In 
Hawksworth et al., 1995. supra, page 171) and all mitosporic fungi (Hawksworth et aL, 1995. 
supra). In a more preferred embodiment, the fungal host cell is a yeast cell. "Yeast" as used 
herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and 

35 yeast belonging to the Fungi Imperfectl (Blastomycetes). Since the classification of yeast may 
change in the future, for the purposes of this invention, yeast shall be defined as described in 
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Bioiogy and Activities Of Yeast (Skinner, F.A., Passmore. S.M., and Davenport. R.R., eds. Soa 
App, Bacterioi, Symposium Series No. 9, 1980). 

In an even more preferred embodiment, the yeast host cell Is a Candida, Hansenuia, 
Ktuyveromyces, Pichia, Saccharomyces, Schizosaccharomyces. or Yanowia celt. 
5 In a most preferred embodiment, the yeast host cell is a Saccharomyces carisbergen- 

sis, Saccharomyces cerevislae, Saccharomyces diastaticus, Saccharomyces dougiasii, Sac- 
charomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis ceil. In another 
most preferred embodiment, the yeast host cell is a Kiuyveromyces iactis cell, in another most 
preferred embodiment, the yeast host cell Is a Yanvwia iipoiytica cell. 

10 In another more preferred embodiment, the fungal host cell is a filamentous fungal celi. 

"Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota 
(as defined by Hawksworth et al, 1995, supra). The filamentous fungi are characterized by a 
mycelial wall composed of chitin. cellulose, glucan. chitosan, mannan, and other complex 
polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obliga- 

15 tely aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by 
budding of a unicellular thallus and carbon catabolism may be fermentative. 

In an even more preferred embodiment, the filamentous fungal host cell is a cell of a 
species of. but not limited to. Acremonium, Aspergiiius, Fusarium, Humicola, Mucor, My- 
celiophthora, Neurospora, Peniclilium, Thietavia, Tofypoctadium, or Trichoderma. 

20 In a most preferred embodiment, the filamentous fungal host cell is an Aspergilius 

awamori, AspergiHus foetidus, Aspergilius japonicus, Aspergillus niduians, Aspergillus niger or 
Aspergillus oryzae cell. In another most preferred embodiment, the filamentous fungal host 
cell is a Fusarium bactridioides, Fusarium cerealis, Fusarium crookweilense, Fusarium cul- 
morum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium ne- 

25 gundi, Fusarium oxy^porum, Fusarium reticulatum, Fusarium roseum, Fusarium sambudnum, 
Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulo- 
sum, Fusarium trichothedoides. or Fusarium venenatum celt. In an even most preferred em- 
bodiment, the filamentous fungal parent celMs a Fusarium venenatum (Nirenberg s p. nov.) 
celt, in another most prefened embodiment, the filamentous fiingal host cell is a Humicoia in- 

30 solans, Humicoia lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, 
Penidllium purpurogenum, melavia terrestris, Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum, Trichoderma reeseU or Trichoderma viride cell. 

Fungal cells may be transformed by a process involving protoplast formation, transfor- 
mation of the protoplasts, and regeneration of the cell wall in a manner known perse. Suitable 

35 procedures for transfonmation of Aspergillus host ceils are described in EP 238 023 and Yelton 
et al, 1984. Proceedings of the National Academy of Sciences USA 81: 1470-1474. Suitable 
methods for transforming Fusarium species are described by Malardier et af„ 1989. Gene 78: 
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147-156 and WO 96/00787. Yeast may be transformed using the procedures described by 
Becker and Guarente, In Abelson. J.N. and Simon, M.I.. editors. Guide to Yeast Genetics and 
Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., 
New Yorlc; Ito et al., 1983, Journal of Bacteriology 153: 163; and Hinnen et aL, 1978, Proceed- 
5 ings of the National Academy of Sciences USA 75: 1 920. 

Processes for preparing functional 6H-61 xylanases 

The present invention also relates to methods for producing a xylanase of the present 
invention comprising (a) cultivating a strain, which in its wild-type form is capable of producing 
the xylanase; and (b) recovering the xylanase. Preferably, the strain is a fungus, more pref- 

10 erably of the genus Humicola. particularly Humicola insolens or Coprinus, such as Coprinus 
cinereus or Thelavia such as Thelavia tenresths 

The present invention also relates to methods for producing a xylanase of the invention 
comprising (a) cultivating a host cell under conditions conducive for production of the xylanase; 
and (b) recovering the xylanase. 

15 In the production methods of the present invention, the cells are cultivated in a nutrient 

medium suitable for production of the xylanase using methods known in the art. For example, 
the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation 
(including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial 
fermentors performed in a suitable medium and under conditions allowing the polypeptide to 

20 be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium com- 
prising carbon and nitrogen sources and inorganic salts, using procedures known in the art. 
Suitable media are available from commercial suppliers or may be prepared according to pub- 
lished compositions (e.g., in catalogues of the American Type Culture Collection). If the xy- 
lanase is secreted into the nutrient medium, the xylanase can t>e recovered directly from the 

25 medium. If the xylanase is not secreted, it can be recovered from cell lysates. 

The xylanase may be detected using methods known in the art and modifications 
thereof that are specific for the xylanase. These detection methods may include use of spe- 
cific antibodies, formation of a xylanase - carbohydrate complex, or disappearance of an acti- 
vated carbohydrate substrate. For example, a carbohydrase assay may be used to detenmine 

30 the activity of a carbohydrase on a carbohydrate with and without presence of xylanase. 

The resulting xylanase may be recovered by methods known in the art For example, 
the xylanase may be recovered from the nutrient medium by conventional procedures Includ- 
ing, but not limited to. centrifugation, filtration, extraction, spray-drying, evaporation, or precipi- 
tation. 

35 The polypeptides of the present invention may be purified by a variety of procedures 

known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hy- 
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drophobic, chromatofocusing, and size exclusion), electrophoretic prooedures (e.g., prepara- 
tive isoelectric focusing), differential solubility (e.g. ammonium sulfate precipitation), SDS- 
PAGE. or extraction (see, e.g. Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH 
Publishers, New York, 1989). 

5 Transgenic plants 

The present invention also relates to a transgenic plant, plant part, or plant cell which 
has been transformed with a nucleotide sequence encoding a xylanase of the present 
invention so as to express and produce the xylanase. In one embodiment the plant could be 
used as host for production of xylanase in recoverable quantities. The xylanase may be 
10 recovered from the plant or plant part. Alternatively, the plant or plant part containing the 
recombinant xylanase may be used as such for improving the quality of a food or feed, e.g., 
improving nutritional value, palatabllity, and rheological properties, or to destroy an antinutritive 
factor. In particular the plant or plant parts expressing the xylanase may be used as an 
improved starting material for production of fuel-aloohols or bio-ethand . 
15 The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a 

monocot). Examples of monocot plants are grasses, such as meadow grass (blue grass, 
Poa), forage grass such as festuca, lolium. temperate grass, such as Agrostls, and cereals, 
e.g., wheat, oats. rye. bariey. rice, sorghum, and maize (com). 

Examples of dicot plants are tobacco, legumes, such as lupins, potato, sugar beet, 
20 pea, bean and soybean, and cruciferous plants (family Brassicaceae), such as cauliflower, 
rape seed, and the closely related model organism Ambidopsis tiiaiiana. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. Also 
specific plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, peroxisomes, and 
cytoplasm are considered to be a plant part. Furthermore, any plant cell, whatever the tissue 
25 origin, is considered to be a plant part. 

Also included within the scope of the present invention are the progeny of such plants, 
plant parts and plant cells. 

The transgenic plant or plant cell expressing a xylanase of the present Invention may 
be constructed in accordance witii methods known In the art. Briefly, the plant or plant cell is 
30 constructed by incorporating one or more expression constructs encoding a xylanase of the 
present invention into the plant host genome and propagating the resulting modified plant or 
plant cell Into a transgenic plant or plant cell. 

Convenientiy. the expression construct is a nucleic acid construct which comprises a 
nucleotide sequence encoding a xylanase of the present invention operably linked with 
35 appropriate regulatory sequences required for expression of the nucleotide sequence in the 
plant or plant part of choice. Furthenmore, the expression construct may comprise a selectable 
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marker useful for identifying host cells into which the expression construct has been integrated 
and DNA sequences necessary for introduction of the construct into the plant in question (the 
latter depends on the DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences and 
optionally signal or transit sequences, is detemriinecf, for example, on the basis of when, 
where, and how the xylanase is desired to be expressed. For instance, the expression of the 
gene encoding a xylanase of the present invention may be constitutive or inducible, or may be 
developmental, stage or tissue specific, and the gene product may be targeted to a specific 
tissue or plant part such as seeds or leaves. Regulatory sequences are, for example, 
described by Tague et ai, 1988, Plant Physiology 66: 506. 

For constitutive expression, the 35S-CaMV promoter may t>e used (Franck et aL, 
1980, Cell 21: 285-294). Organ-specific promoters may be. for example, a promoter from stor- 
age sink tissues such as seeds, potato tubers, and fruits (Edwards & Ck)ruzzi. 1990, Ann. Rev, 
Genet 24: 275-303). or from metabolic sink tissues such as meristems (Ito e^ aL, 1994, Plant 
MoL Biol. 24: 863-878), a seed specific promoter such as the glutelin, prolamin. globulin, or 
albumin promoter from rice (Wu et aL, 1998, Plant and Cell Physiology 39: 885-889), a Vicia 
faba promoter from the legumin B4 and the unknown seed protein gene from Vicia faba (Con- 
rad e^ a/., 1998, Journal of Plant Physiology 152: 708-711). a promoter firom a seed oil body 
protein (Chen et ai, 1998, Plant and Cell Physiology 39: 935-941), the storage protein napA 
promoter from Brassica napus, or any other seed specific promoter known in the art, e.g.. as 
described in WO 91/14772. Furthermore, the promoter may be a leaf specific promoter such 
as the rbcs promoter from rice or tomato (Kyozuka et aL, 1993, Plant Physiology 102: 991- 
1000, the chlorella virus adenine methyltransferase gene promoter (Mitra and Higgins, 1994, 
Plant Molecular Biology 26: 85-93), or the aldP gene promoter from rice (Kagaya et aL, 1995, 
Molecular and General Genetics 248: 668-674). or a wound inducible promoter such as the 
potato pin2 promoter (Xu et aL, 1993, Plant Molecular Biology 22: 573-588). 

A promoter enhancer element may also be used to achieve higher expression of the 
xylanase in the plant. For instance, the promoter enhancer element may be an intron which is 
placed between the promoter and the nucleotide sequence encoding an xylanase of the 
present invention. For instance. Xu et aL, 1993, supra disclose the use of the first intron of the 
rice actin 1 gene to enhance expression. 

The selectable maricer gene and any other parts of the expression construct may be 
chosen from those available in the art. 

The nucleic acid construct is incorporated into the plant genome according to 
conventional techniques known in the art, including ^grobacfeni/m-mediated transformation, 
virus-mediated transformation, m icroinjection, particle bombardment, biolistic transformation, 
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and electroporalion (Gasser et al., 1990. Science 244: 1293; Potrykus, 1990, Biotechnology 
8: 535; Shimamoto ef a/.. 1989. Nature 338: 274). 

Presently. Agrobacterium tu/ne/ac/ens-mediated gene transfer is the method of choice 
for generating transgenic dicots (for a review, see Hooykas and Schilperoort. 1992, Piant 
Molecular Biology 19: 15-38). However it can also be used for transfonming monocots. 
although other transformation methods are generally preferred for these plants. Presently, the 
method of choice for generating transgenic monocots is particle bombardment (microscopic 
gold or tungsten particles coated with the transforming DNA) of embryonic calli or developing 
embryos (Christou. 1992, Plant Journal 2: 275-281; Shimamoto, 1994, Current Opinion 
Biotechnology 5: 158-162; VasrI et aL 1992. Bio/Technology 10: 667-674). An alternative 
method for transformation of monocots is based on protoplast transformation as described by 
Omirulleh et ai , 1 993. Plant tAolecular Biology 21:41 5-428. 

Following transformation, the transformants having incorporated therein the 
expression construct are selected and regenerated into whole plants according to m ethods 
well-known in the art. 

The present invention also relates to methods for producing an xylanase of the present 
invention comprising (a) cultivating a transgenic plant or a plant cell comprising a nucleotide 
sequence encoding an xylanase of the present invention under conditions conducive for 
production of the xylanase; and (b) recovering the xylanase. 

Compositions comprising functional GH-61 xyJanases 

In a still further aspect, the present invention relates to compositions comprising a xy- 
lanase of the present invention and their preparation, in particular compositions wherein the 
xylanase of the invention is the wb\ot component of the composition, e.g.. a mono-component 
composition. 

The c omposition m ay further c omprise o ne o r m ore e nzymes. i n p articular c arbohy- 
drases such as amylase, glucanase. xylanase, gaiactanase. mannanase etc, The enzymes 
may also include enzymes such as aminopeptidase. carboxypeptidase. catalase. chitinase, 
cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, alpha-galactoskjase, 
beta-galactosidase, gluooamylase, alpha-glucoskJase, beta-glucosidase. haloperoxidase, in- 
vertase, laccase. lipase, mannosldase, oxidase, pectinolytic enzymes, peptidoglutaminase. 
peroxidase, phytase. polyphenoloxkjase, proteolytic enzyme, ribonuclease. transglutaminase, 
or xylanase. 

The compositions may be prepared In accordance with methods known in the art and 
may be In the form of a liquid or a solid composition. For instance, the xylanase composition 
may be formulated using methods known to the art of formulating enzymes and/or 
phamnaceutical products, e.g. into coated or uncoated granules or micro-granules. The 
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xylanase to be included in the composition may be stabilized in accordance with methods 
known in the art e.g. by stabilizing the xylanase in the composition by adding and antioxidant 
or reducing agent to limit oxidation or the xylanase of it may be stabilized by adding polymers 
such as PVP. PVA. PEG or other suitable polymers known to be beneficial to the stability of 
5 polypeptides in solid or liquid compositions 

in a further embodiment, the composition of the invention is a detergent composition 
which, in addition to the xylanase* comprises a surfactant and optionally compounds selected 
from the group consisting of builders such as zeolites, bleaching agents such as percarfoonate, 
bleach enhancers such as TAED or NOBS, suds suppressors, fragrants, etc. 
10 In a further embodiment, the composition of the invention is a feed composition which, 

in addition to the xylanase, comprises a cereal or grain product. 

In a further embodiment, the composition of the invention comprise a xylanase 
of the invention and a vegetable flour, in particular a cereal flour. In particular the composition 
is a dough or premix composition use for baking bread. 
15 In a further embodiment, the composition of the invention is femientable composition, 

which in addition to the xylanase, comprises one or more nutrients for a microorganism. 

In a further embodiment, the composition of the invention is a pulping composition, 
which in addition to the xylanase. comprises pulp. 



20 Applications of functional GH-61 xyianases 

In a still further aspect, the invention relates to processes involving the xylanase of the 
invention. The xyianases of the invention may suitably be applied In any process involving car- 
bohydrates, particulariy hydrolysis of xyiotosic substrates. 

In one embodiment, the process of the invention is a cleaning process comprising con- 
25 tacting a cellulosic fabric to be cleaned with a xylanase of the invention. Such a process is par- 
ticulariy useful when the fabric is soiled with xylolosic soilings 

In another embodiment, the process of the invention is a process for the preparation of 
food and/or feed composition comprising mixing a food/feed mixture with a xylanase of the in- 
vention. Xylanase treatment of animal feed and silage may release free pentose sugars or may 
30 reduce or modify the viscosity of xylan rich feed such as v^eaL 

In particular the process of the invention is a process for preparing a flour composition 
comprising mixing a xylanase of the invention with a vegetable flour, particulariy a cereal flour 
or a process for preparing a dough composition comprising mixing a xylanase of the invention 
with dough ingredients or a process for preparing a baked product comprising heating a dough 
35 composition of the invention. 



32 



10274.010-OK 



In yet another embodiment, the process of the invention Is a process for xylanase hy- 
drolysis of agricultural wastes for production of alcohol fuels. 

In yet another embodiment the process of the invention is a brewing process wherein 
the presence of a xylanase may improve filterability of the wort. 
5 In yet another embodiment, the process of the invention is a process for preparation of 

fruit or vegetable juices, wherein presence of a xylanase may improve filtration and increase 
yields. 

In yet another embodiment the process of the invention is a process for treatment of 
lignolosic materials and pulp with a xylanase. wherein the xylanase will dissolve xylofosic ma- 
10 terials. 

The enzyme of the invention (produced by the method of the Invention) may be pro- 
vided in the form of a granulate, preferably a non-dusting granulate, a liquid, in particular a 
stabilized liquid, a slurry, or a protected enzyme. For certain applications, immobilization of the 
15 enzyme on a solid matrix may be preferred. 

DETERGENT DISCLOSURE AND EXAMPLES 

The xylanase of the invention may be added to and thus become a component of a de- 
tergent composition. 

The detergent composition of the invention may for example be formulated as a hand 
20 or machine laundry detergent composition including a laundry additive composition suitable for 
pre-treatment of stained fabrics and a rinse added fabric softener composition, or be formu- 
lated as a detergent composition for use in general household hard surface cleaning opera- 
tions, or be formulated for hand or machine dishwashing operations. 

In a specific aspect, the invention provides a detergent additive comprising the en- 
25 zyme of the invention. The detergent additive as well as the detergent composition may com- 
prise one or more other enzymes such as a protease, a lipase, a cutinase, an amylase, a car- 
bohydrase, a cellulase, a pectinase. a mannanase. an arabinase, a galactanase. a xylanase, 
an oxidase, e.g., a laccase, and/or a peroxidase. 

In general the properties of the chosen enzyme(s) should be compatible with the se- 
30 lected detergent, (i.e. pH-optimum, compatibility with other enzymatic and non-enzymatic in- 
gredients, etc.), and the enzyme(s) should be present in effective amounts. 

Proteases: Suitable proteases include those of animal, vegetable or microbial origin. 
Microbial origin is preferred. Chemically modified or protein engineered mutants are included. 
The protease may be a serine protease or a metallo protease, preferably an alkaline microbial 
35 protease or a trypsin-like protease. Examples of alkaline proteases are subUlisins, especially 
those derived from Bacillus, e.g., subtilisin Novo, subtifisin Carisberg, subtillsin 309, subtilisin 
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147 a nd s ubtilisin 1 68 ( described i n WO 8 9/08279). E xamples o f t rypsin-like p roteases a re 
trypsin (e.g. of porcine or bovine origin) and the Fusarium protease described In WO 89/06270 
and WO 94/25583. 

Examples of useful proteases are the variants described in WO 92/19729, WO 
5 98/20115. WO 98/20116. and WO 98/34946. especially the variants with substitutions in one 
or more of the following positions: 27, 36. 57, 76. 87. 97, 101, 104, 120. 123, 167. 170, 194, 
206. 218, 222. 224, 235 and 274. 

Prefen^d commerdally available protease enzymes include Alcalase®, Savlnase®, 
Primase®, Duralase®, Esperase®. and Kannase® (Novozymes MS), Maxatase®, Maxacal®, 
10 Maxapem®, Properase®, Purafed®, Purafect OxP®, FN2®, and FN3® (Genencor Intema- 
tional Inc.). 

Lipases: Suitable lipases include those of bacterial or fungal origin. Chemically modi- 
fied or protein engineered mutants are included. Examples of useful lipases include lipases 
from Humicola (synonym Themiomyces). e.g. from H. lanuginosa (T. lanuginosus) as de- 
15 scribed in EP 258 068 and EP 305 216 or from H. Insolens as described tn WO 96/13580, a 
Pseudomonas lipase, e.g. from P. alcallgenes or P. pseudoalcaligenes (EP 218 272), P. ce- 
pacia (EP 331 376). P. stutzeri (GB 1,372,034), P. fluorescens, Pseudomonas sp. strain SO 
705 (WO 95/06720 and WO 96/27002), P. wisconsinensis (WO 96/12012). a Bacillus lipase, 
e.g. from B. subtilis (Dartois et al. (1993). Biochemica et Biophysica Acta. 1131, 253-360), B. 
20 stearothemiophilus (JP 64/744992) or B. pumilus (WO 91/16422). 

Other examples are lipase variants such as those described in WO 92/05249, WO 
94/01541, EP 407 225, EP 260 105. WO 95/35381. WO 96/00292, WO 95/30744. WO 
94/25578, WO 95/14783. WO 95/22615. WO 97/04079 and WO 97/07202. 

Preferred commercially available lipase enzymes include UpolaseTM and Lipolase Ul- 
25 traTM (Novozymes A/S). 

Amylases: Suitable amylases (alpha and/or beta) include those of bacterial or fungal 
origin. Chemically modified or protein engineered mutants are included. Amylases include, for 
example. Q-amylases obtained from Bacillus, e.g. a special strain of B. licheniformis. described 
in more detail in GB 1 .296.839. 
30 Examples of useful amylases are the variants described in WO 94/02597. WO 

94/18314. WO 96/23873, and WO 97/43424. especially the variants with substitutions in one 
or more of the following positions: 15. 23, 105, 106, 124. 128, 133. 154, 156, 181. 188, 190, 
197. 202, 208. 209, 243. 264. 304. 305. 391. 408. and 444. 

Commercially available amylases are Duramyr", Termamyl™. Fungamy^** and 
35 BAN^" (Novozymes A/S). Rapidase^'^ and Purastar^" (from Genencor International Inc.). 

Cellulases: Suitable cellulases include those of bacterial or fungal origin. Chemically 
modified or protein engineered mutants are included. Suitable cellulases include cellulases 
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from the genera Bacillus. Pseudomonas, Humicota, Fusarium, Thielavia, Acremonium. e.g. the 
fungal cellulases produced from Humicola Insolens, Myoeliophthora thermophila and Fusarium 
oxysporum disclosed in US 4.435.307, US 5,648,263. US 5,691.178. US 5,776,757 and WO 
89/09259. 

5 Especially suitable cellulases are the alkaline or neutral cellulases having colour care 

benefits. Examples of such cellulases are cellulases described In EP 0 495 257, EP 0 531 372, 
WO 96/11262, WO 96/29397. WO 98/08940. Other examples are cellulase variants such as 
those described in WO 94/07998, EP 0 531 315, US 5.457.046. US 5.686.593. US 5,763.254. 
WO 95/24471 , WO 98/12307 and PCT/DK98/00299. 
10 Commercially available cellulases include Celluzyme®, and Carezyme® (No- 

vozymes), Clazinase®, and Puradax HA® (Genencor Intemational Inc.). and KAC-500(B) ® 
(Kao Corporation). 

Peroxidases/Oxidases: Suitable peroxidases/oxidases include those of plant, bacterial 
or fungal origin. Chemically modified or protein engineered mutants are included. Examples of 

15 useful peroxidases include peroxidases from Coprinus, e.g. from C. dnereus, and variants 
thereof as those described in WO 93/24618, WO 95/10602. and WO 98/15257. 

Commercially available peroxidases include Guardzyme® (Novozymes A/S). 
The detergent enzyme(s) may be included in a detergent composition by adding sepa- 
rate additives containing one or more enzymes, or by adding a combined additive comprising 

20 all of these enzymes. A detergent additive of the invention, i.e. a separate additive or a com- 
bined additive, can be formulated e.g. as a granulate, a liquid, a slurry, etc. Preferred detergent 
additive formulations are granulates, in particular non-dusting granulates, liquids, in particular 
stabilized liquids, or slurries. 

Non-dusting granulates may be produced, e.g., as disclosed in US 4.106.991 and 

25 4,661,452 and may optionally be coated by methods known in the art. Examples of waxy coat- 
ing materials are poly (ethylene oxide) products (polyethyleneglycol. PEG) with mean molar 
weights of 1000 to 20000: ethoxylated nonylphenols having from 16 to 50 ethylene oxide units: 
ethoxylated fatty alcohols in which the alcohol contains from 12 to 20 carbon atoms and In 
which there are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; and mono- and di- and 

30 triglycerides of fatty acids. Examples of film-forming coating materials suitable for application 
by fluid bed techniques are given in GB 1 483591. Liquid enzyme preparations may, for in- 
stance, be stabilized by adding a polyol such as propylene glycol, a sugar or sugar alcohol, 
lactic acid or boric acid according to established methods. Protected enzymes may be pre- 
pared according to the method disclosed in EP 238,216. 

35 The detergent composition of the invention may be in any convenient form. e.g. a bar, 

a tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aqueous, typically 
containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 
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The detergent composition comprises one or more surfactants, which may be non- 
ionic i nctuding s emi-polar a rul/or a nionic a nd/or cationic a nd/or z witterionic. The s urfactants 
are typically present at a level of from 0.1 % to 60% by weight. 

When Included therein the detergent will usually contain from about 1% to about 40% 
5 of an anionic surfactant such as linear alkylbenzenesulfonate, alpha-olefinsulfonate. alky! sul- 
fate (fatty alcohol sulfate), alcohol ethoxysulfate. secondary alkanesulfonate, alpha-sulfo fatty 
acid methyl ester, alkyi- or alkenylsuccrnic acid or soap. 

When included therein the detergent will usually contain from about 0.2% to about 
40% of a non-ionic surfactant such as alcohol ethoxyiate, nonylphenol ethoxylate, alkylpolygly- 
1 0 coside, a Ikyldimethytamineoxide. e thoxylated fatty a cid m onoethanoiamide, fatty a dd m ono- 
ethanolamide, polyhydroxy alkyI fatty acid amide, or N-acyl N-aikyI derivatives of glucosamine 
("glucamides*). 

The detergent may contain 0-65 % of a detergent builder or complexing agent such as 
zeolite, diphosphate, triphosphate, phosphonate, carbonate, citrate, nitrilotriacetic acid, ethyl- 

15 enediaminetetraacetic acid, diethylenetriaminepentaacetic acid, alkyi- or alkenylsuccinic acid, 
soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). 

The detergent may comprise one or wore polymers. Examples are carboxymethylcel- 
lulose, poly(vinylpyrrolldone), poly (ethylene glycol), poly(vinyl alcohol), poly(vinylpyridine-N- 
oxide), poly(vinyilmidazole), polycarboxylates such as polyacrylates, maleic/acrylic add co- 

20 polymers and lauryl methacrylate/acrylic add copolymers. 

The detergent may contain a bleaching system which may comprise a H202 source 
such as perborate or percarbonate which may be combined with a peracid-forming bleach ac- 
tivator such as tetraacetylethylenediamine or nonanoyloxybenzenesulfonate. Alternatively, the 
bleaching system may comprise peroxyadds of e.g. the amide, imide. or suifone type. 

25 The enzyme(s) of the detergent composition of the invention may be stabilized using 

conventional stabilizing agents, e.g., a polyol such as propylene glycol or glycerol, a sugar or 
sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g.. an aromatic borate ester, 
or a phenyl boronic acid derivative such as 4-formylphenyl boronic actd, and the composition 
may be formulated as described in e.g. WO 92/19709 and WO 92/19708. 

30 The detergent may also contain other conventional detergent ingredients such as e.g. 

fabric conditioners including days, foam boosters, suds suppressors, anti-conroslon agents, 
soil-suspending agents, anti-soil redeposition agents, dyes, bacteriddes. optical brighteners, 
hydrotropes, tarnish inhibitors, or perfumes. 

It Is at present contemplated that in the detergent compositions any enzyme, in par- 

35 ticular the enzyme of the invention, may be added in an amount corresponding to 0.01-100 mg 
of enzyme protein per liter of wash liqour, preferably 0.05-5 mg of enzyme protein per liter of 
wash Itqour, in particular 0.1-1 mg of enzyme protein per liter of wash liqour. 



36 



1 0274.01 0-OK 



The enzyme of the invention nnay additionally be incorporated in the detergent formu- 
lations disclosed in WO 97/07202 which is hereby incorporated as reference. 

EXAMPLES 

Example 1: Isolation of DNA encoding GH-61 polypeptides from Thelavia terrestris 
5 Myoetium of a T. terrestris is grown on MEX-1 to induce proteins that respond to com- 

plex cellulosic medias. After 4 days at 37 ''C, the mycelium Is harvested by filtration though 
Whatman 1MM filter paper. Mycelium is frozen in liquid nitrogen and stored until further use. 
RNA isolation is performed according to the protocol of Chomczyniski and Saochi, 1987 (Ana- 
lytical Biochemistry 162: 156-159). The Poly(A) Quik mRNA isolation kit is used to purify polyA 

10 enriched RNA for cDNA production (Stratagene USA). Production of a cDNA library is 
achieved according to the SMART cDNA library construction kit (Clontech USA). Sfil restricted 
double strand cDNAs are cloned into Lambda TriptEx vector and the plasmid containing colo- 
nies is recovered by mass excision according to the SMART protocol. 

Individual plasmkjs containing cDNAs are prepared using existing 96 well silica based 

15 plasmid preparation systems such as Qiagen Qia-turfoo 96. Qiagen corp. USA). Once plasmid 
template is prepared, a!! plasmids are sequenced with the vector fonvard primer using conven- 
tional sequencing methods and equipment. 

identification of GH61 Expressed Sequence Tags (ESTs) are achieved by searching 
the entire non redundant protein database (for example. SWALL) with a program such as 

20 BlastP (Altschul. S.F., Gish. W.. Miller, W., Myers. E.W. & Lipman. DJ. (1990) "Basic local 
alignment search tool." J. Mol. Bbl. 215:403-410. Gish. W. & States, D.J. (1993) "Identification 
of protein coding regions by database similarity search." Nature Genet 3:266-272. Madden. 
T.L.. Tatusov, R.L. & Zhang, J. (1996) "Applications of network BLAST server" Meth. Enzymol, 
266:131-141. Altschul, S.F.. Madden. T.L.. SchMer. A.A., Zhang, J., Zhang. Z.. Miller. W. & 

25 Lipman, D.J. (1997) "Gapped BLAST and PSI-BLAST: a new generation of protein database 
search programs." Nucleic Acids Res. 25:3389-3402. Zhang, J. & Madden, T.L. (1997) "Pow- 
erBLAST: A new network BLAST application for interactive or automated sequence analysis 
and annotation." Genome Res. 7:649-656. 

Sequences with similarities to existing GH-61 family sequences are Identified based on 

30 a blast probability score matrix. 

A clone containing SEQ ID NO: 1 was selected for further analysis 

Example 2: Isolation of DNA encoding GH-61 polypeptides from Coprinus clnereus 

The procedure of example 1 was repeated for a strain of Humicola insoiens, A clone 
containing SEQ ID NO: 3 was selected for further analysis. 
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Example 3: Isolation of DNA encoding GH-61 polypeptides from Humlcola Insolens 

The procedure of example 1 was repeated for a strain of Coprinus cinereus for exam- 
ple Coprinius dnerus (CBS394.65) obtainable from the CBS. A clone containing SEQ ID NO: 5 
was selected for further analysis. 

5 Example 4: Construction of an Aspergillus expression vector for GH61 DNA sequences 
SEQ ID NO: 3 was amplified in the following manner: 1 microliter of cDNA (approximately 10 
nanograms of DNA) was used as template in a PCR reaction with the two primers A and B. 

Primer A: 5'-6CGGAATTCATCATGAGGCCCTTCTCCCTC-y (SEQ ID NO:7) 
10 Primer B: S-ATTTGCGGCCGCTTCCCGTCATCCTCTAAGGC-y (SEQID NO:8) 

SEQ ID NO: 1 and 5 can be amplified in a similar manner using the primers: 

For SEQ ID NO: 1: 

15 Primer A: 5 -GCGGAATTCATGAAGCTCACCACCTCGGT-3' (SEQ ID N0:9) 

Primer B: 5'-ATTTGCGGCCGCGCAGCCAACCAACCTGGAAT.3' (SEQ ID NO:10) 

For SEQ ID NO: 5: 

Primer A: 5'-GCGGAATTCACAATGAAGGTCTTCGCATAC-3' (SEQ ID N0:1 1 ) 

20 primer B: 5'-ATTTGCGGCCGCACGATGCGATGAGCATTTAT-3* (SEQ ID NO: 1 2) 

5 pmoles of each primer was used in a SO microliter reaction volume. The Qiagen 
ProofStart high fidelity DNA polymerase and buffer were used according to the manufacturer's 
instructions (Qiagen, USA). Briefly, the reaction was placed in a thermal cycler (MJ Research, 

25 Dyad, USA) and cycled under the following reaction conditions: An initial denaturation of 5 
minutes at 95 degrees Celsius, 25 cycles of the following: 94 degrees-30 seconds, 55 de- 
grees-30 seconds, 72 degrees 2 minutes. A final extension temperature of 72 degrees for 10 
minutes was then used. Ailquots of the PCR reaction were separated on a 1% agarose gel. 
One distinct band was seen: The size of this band (1.1 kb) corresponded well with the pre- 

30 dieted size of the open reading frame. 

The fragment was digested with EcoRI and NotI which cut in the overhangs introduced 
by the PCR primers. The digested fragments were Isolated and cloned into pMStr54, an As- 
pergillus expression plasmid based on the plasmid pCaHj527 (see the examples of interna- 
tional patent application WO 00/70064) constructed as described in example 7 of WO 

35 02/12472. Plasmid DNA was isolated from colonies of the cloning experiment. The colonies 
were sequenced with vector primers PN/^I (5-GTT TCC AAC TCA ATT TAC CTC-3') and 
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MHasSNotI (5 -TTG CCC TCA TCC CCA TCC TTT-3') which prime in opposite directions Into 
the piasmid Insert. It was determined that no errors were introduced In any of the insert se- 
quences as a result of PCR. 

Example 5: Expression of SEQ ID NOS: 1, 3 and 5 In Aspergillus: 

5 SEQ ID NOS: 1. 3 and 5 were transformed into Aspergiiius oryzae strain JAL355 (dis- 

closed in intemational patent application WO 01/98484A1). Transfonmants of SEQ ID NOS: 1 .3 
and 5 were re-isolated twice under selective and noninducing conditions on Cove minimal 
plates with 1M sucrose as a carbon source and lOmM nitrate. (See recipe of the manufacturer) 
To test expression of SEQ ID NOS: 1,3 and S, transformants were grown for 3 days and 4 
10 days at 30 degrees Celsius in tubes with 10 ml YPM (2% peptone, 1% yeast extract, 2% mal- 
tose). Supernatants were run on NuPage 10% Bis-Tris SDS gels (Invitrogen) as recommended 
by the manufacturer. All Aspergillus isolates grew well even when induced for the expression 
of SEQ ID NOS: 1.3 and 5. 

Example 6: Purification of SEQ ID NO: 2 from expression of SEQ ID NOS: 1 in Aspergll- 
15 lus: 

A culture supematant from a fermentation of the Aspergillus oryzae strain expressing 
SEQ ID NO: 1 was filtered through a 0.22 pm filter to remove the mycelia. 350 ml of the Altered 
supernatant was diluted to 1450 ml with water and pH adjusted to 7.5 resulting in a conductiv- 
ity of 1.8 mS/cm. This solution was loaded onto a 50 ml Q-Sepharose anion exchange column 

20 equilibrated with 25 ml\^ Tris pH 7.5. The column was washed with at>out 15 column volumes 
of 25 mM Tris pH 7.5 and bound proteins were eluted with a NaCI gradient Increasing linearly 
from 0 to 0.5 M over 20 column volumes. From SDS-PA6E it was seen that a polypeptide with 
a molecular weight of about 29 kDa was eluted during the wash with 25 mM Tris, pH 7.5. Frac- 
tions containing the 29 kDa polypeptide were pooled and concentrated on an Amicon uitrafil- 

25 tration device with a 6 kDa cut off filter (Dow. GR 81 PP). The concentrated pool was at least 
95% pure as estimated from SDS-PAGE and N-tenminai sequencing of the polypeptide gave 
the sequence corresponding to SEQ ID NO: 2 (HYTFPQTDtNGQLSGE). 

Example 7: Purification of SEQ ID NO: 4 from expression of SEQ ID NOS: 3 in Aspergil- 
lus: 

30 A culture supematant from a fermentation of the Aspergillus oryzae strain expressing 

SEQ ID NO: 3 was filtered through a 0.22 pm filter to remove the mycelia. 500 ml of the filtered 
supematant was ultrafiltrated In a Filtron device with a 10 kDa cut off membrane. pH was ad- 
justed to 9.5 and the filtrate loaded onto a 50 ml Q-Sepharose anion exchange column equili- 
brated with 25 mM glycine pH 9.5. The column was washed with at)out two column volumes of 
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25 mM glycine pH 9.5 and bound proteins were eiuted with a NaCI gradient increasing linearly 
from 0 to 0.5 M over 20 column volumes. From SDS-PAGE It was seen that a polypeptide with 
a molecular weight of about 32 l<Da was eiuted during the wash with 25 mM glycine pH 9.5. 
Fractions containing the 32 kDa polypeptide were pooled and concentrated on an Amicon ul- 
5 trafiltration device with a 20 kDa cut off filter (DOS, GR 61PP). Purity of the concentrated pool 
was around 85% as estimated from SDS-PAGE and concentration around 2.4 mg/ml esti- 
mated from absorbance at 280 nm and theoretical extinction coefficient. 

Example 8: Testing activity of SEQ ID NOS: 2, 4 and 6 

An oat AZCL-xylan, a birchwood AZCL-xytan and a wheat AZCL arabinoxylan sub- 

10 strate suspension were prepared by suspending 2 milligram per milJIIitre of the substrate in a 
Nap buffer. pH 7 containing 0.0225 % w/w Brij. 

Activity of the purified SEQ ID NOS 2, 4 and 6 were tested by mixing 500 microliter 
substrate suspension with 100 microliter of a solution containing isolated SEQ ID NOS 2, 4 or 
6. This mixture was incubated at 37 ''C, wherein after undigested substrate was sedimentated 

15 by centrifugation and digested substrate was evaluated by measuring absorbance of the su- 
pematant at 590 nanometers. A blank value determined by replacing for one sample the en- 
zyme solution with a buffer was subtracted the absorbance measurements. 



Results: 



GH-61 polypeptide 


Absorbance for 
Oat AZCL-xylan 


Absorbance for 
Birchwood AZCL- 
xylan 


Absorbance for 
Wheat AZCL- 
arabinoxylan 


SEQ ID NO: 2 


0,0172 


0.1077 


0.1296 


SEQ ID NO: 4 


0.0798 


0.4017 


0.8128 


SEQ ID NO: 6 


0.0536 


0.2064 


0.6101 



20 

These results showed for the first time GH-61 polypeptides having xylanase activity. 



Example 9: Testing effect of SEQ ID NOS: 2 and 4 on quality of baked bread. 

Bread was baked according to the sponge & dough method from 2 kg of flour. Ca 
25 propionate was added to the recipe. 



Enzymes were dosed according to the table below: 





1 


2 


3 


4 


5 


6 


7 


8 
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Novamyl 
MANU/kg 


Ann 


Ann 


Ann 


Ann 






40U 




Shearzyme 
FXU/kg 
















200 


SEQ ID NO:4 
H. insoiens 
mg/kg 




4 


d 












SEQ ID NO:2 
7. terrestris 
mg/kg 








1 


2 


5 







Bread was packed in plastic bags and stored at room temperature until analysis. 



Texture and NMR were measured on day 7, 14 and 21 and a small sensory evaluation 
was performed on day 21 . 

Data: 

Texture measurements 

Firmness and elastidty data are shown in figure 1 & 2. 
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Time (Days) 



Ftg I Change in Firmness during storage 




44 -I : 1 1 i i i i 

6 8 10 12 14 16 18 20 22 

Time (Days) 

5 Fig 2 Change in Elasticity during storage 
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The GH 61 xylanases show interesting results compared to the lead combination - No- 
vamyl Shearzyme. Espedaliy the giycosylhydrolase from T. terrestris i n combination with 
Novamyl shows effect on softness and elasticity comparable to or better than Novamyl alone. 

5 Water characteristics & sensory evaluation 

Amount and mobility of water was measured by NMR. The amount and mobility of the 
free water, believed to correlate with the moist feeling of bread, Is shown in figure 3. 

7700 T 1 




Days 

Fig. J Change in mobUity with time (NMR) 

The two Novamyl references show very different results in the measurement of water 
mobility by NMR, The "best" of the Novamyl references show comparable effect to the T. ter- 
restris giycosylhydrolase in combination with Novamyl, however the second Novamyl refer- 
ence show significantly inferior results to these. 

Conclusion 

The GH-61 xylanases - especially from T. terrestris - show improvement in freshkeep- 
ing of bread - on softness, elasticity, perception of molstness and moisture retention capacity. 
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CLAIMS 



1. An isolated GH-61 polypeptide having xylanase activity. 

5 2. The xylanase of claim 1. comprising in it naturally occumng mature form consen/ed 
portions of a H at position 1. A or P at position 59, G at position 60, G at position 75, P or A at 
position 76. W or F at position 100, F or T at position 101, K or C at position 102, J or V or L at 
position 103. L or I or V or M at position 130, P at position 131. G and Xaa and Y at position 
137-139, L or V or I or M at position 140, L or V or I or M at position 141 , R at position 142, E 
10 or Q at positions 143*144, L or V or I at position 148. H or N at position 149, C at position 163 
and P and G and P at position 209-21 1 . 

3. The xylanase of claim 2 comprising the amino add sequence of the mature xylanases 
of SEQ ID NO:2» SEQ ID NO:4 or SEQ ID NO:6. an allelic variant thereof; or a fragment 

1 5 thereof that has the xylanase activity. 

4. The xylanase of claim 3 comprising an amino add sequence which has at least 65% 
identity to the amino acids of the mature xylanases of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID 
NO:6. 

20 

5. The xylanase of daim 4 differing at the most by ten amino adds from amino acids of 
the mature xylanases of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6 

6. The xylanase of claim 5 consisting of the amino acid of the mature xylanases of SEQ 
25 ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. 

7. The xylanase of daim 2 comprising at least one substitution, deletion or insertion out- 
side the conserved portions of the amino add sequence. 

30 8. The xylanase of claim 2 comprising at least one sut>stitution inside the conserved por- 
tions. 

9. The xylanase of daim 7, wherein the number of substitutions, deletions or insertions is 
at the most 10. 

35 
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10. The xylanase of claim 1, wherein the xylanase is enooded by nucleotide sequences 
which hybridize under very low stringency conditions with a polynucleotide probe selected from 
the group consisting of 

(i) the complementary strand of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ 
ID NO:3 or 58 to 660 of SEQ ID NO:5. 

(ii) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 of 
SEQ ID NO:1. 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3. 

(iv) the complementary strand of nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of SEQ 
ID NO:3 or 58 to 300 of SEQ ID NO:5 or 

(V) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1, 502 to 957 of 
SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5. 

11. The xylanase of claim 10. wherein the xylanase is encoded by a polynucleotide com- 
prising the nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ ID 
NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID NO:1. 46 
to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the genetic 
code. 

12. The xylanase of claim 11, wherein the xylanase is encoded by a polynucleotide con- 
sisting of nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ ID 
NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID NO:1. 46 
to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the genetic 
code. 

1 3. The xylanase of claim 1 . obtained from a fungus. 

14. A polynucleotide comprising a nucleotide sequence encoding a xylanase of claims 1 to 
13. 

15. The polynucleotide of claim 14, wherein the nucleotide sequence comprise a subse- 
quence which encode: 

(a) a fragment of SEQ ID NO:2 that exhibit xylanase activity and contains the conserved 

portions. 

(b) a fragment of SEQ ID NO:4 that exhibit xylanase activity and contains the conserved 
portions or 
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(c) a fragment of SEQ ID N0:6 that exhibit xylanase activity and contains the conserved 
portions. 



16. The polynucleotide of claim 15, wherein the nucleotide sequence comprise a subse- 
5 quence of: 

(a) SEQ ID N0:1 which encode fragments of SEQ ID NO:2 that exhibit xylanase activity 
and contains the conserved portions. 

(b) SEQ ID NO:3 which encode fragments of SEQ ID NO:4 that exhibit xylanase activity 
and contains the consented portions or 

10 (c) SEQ ID NO:5 which encode fragments of SEQ ID NO:6 that exhibit xylanase activity 
and contains the conserved portions. 

17. The polynucleotide of claim 16. comprising a nucleotide sequence encoding the amino 
acid sequence of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6 or the mature xylanase thereof. 

15 

18. The polynucleotide of daim 17, comprising the nucleotide sequence of the mature xy- 
lanase coding region of SEQ ID NO:1 , SEQ ID NO:3 or SEQ ID NO:5 



19. The polynucleotide of claim 18, consisting of the nucleotide sequence of the mature 
20 xylanase coding region of SEQ ID N0:1 , SEQ ID NO:3 or SEQ ID NO:5 

20. A nucleic acid construct comprising a nucleotide sequence of claims 14-19 linked to 
one or more control sequences that direct the production of the polypeptide in a suitable host. 

25 21 . A recombinant expression vector comprising the nucleic add construct defined in daim 
20. 



22. A recombinant host cell comprising the nucleic acid construct of daim 20. 
30 23. A method for produdng the xylanase of claims 1 to 13 comprising: 

(a) cultivating a strain, which in its wild-type form is capable of produdng the xylanase, to 
produce the xylanase; and 

(b) recovering the xylanase. 
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24. A method for producing the xylanase of claims 1 to 13, the method comprising: 
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(a) cultivating a recombinant host cell as defined in claim 22 under conditions conducive 
for production of the xylanase; and 

(b) recovering the xylanase. 

5 

25. A transgenic plant comprising a nucleotide sequence of the invention and capable of 
expressing the xylanase of claims 1 to 13. 

26. A composition comprising the xylanase of claims 1 to 13. 

10 

27. The composition of claim 26. further comprising a surfactant. 

28. The composition of claim 26 further comprising a cereal product. 

15 29. Use of a xylanase of claims 1 to 1 3 for hydrolyzing a xyiolosic substrate. 

30. Use of a xylanase of claims 1 to 13 for preparing an animal feed composition 

31 . Use of a xylanase of claims 1 to 13 for preparing a dough composition. 

20 

32. Use of a xylanase of claims 1 to 1 3 for preparing alcohol fuel. 

33. Use of a xylanase of claims 1 to 1 3 for treatment of brewing wort. 

25 34. Use of a xylanase of claims 1 to 1 3 for preparation of fruit or vegetable Juice. 

35. Use of a xylanase of claints 1 to 1 3 for cleaning a fabric 

36. Use of a xylanase of claims 1 to 13 for treatment of pulp. 
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ABSTRACT 

The present invention relates to an isolated GH-61 polypeptide having xylanase activity. 
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10274-100. ST25 
SEQUENCE LISTING 



<110> 
<120> 
<130> 
<160> 

<170> 

<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



Novozymes A/s 
GH-61 xylanases 
102 74/Novozymes 
12 

Patentin version 

1 

699 
ONA 

T. terrestris 
eptide 



Modtaget 
22 jULI 2003 
PVS 



3.1 



mat^peptide 
(52).. O 



CDS 
(1). 



C699) 



<400> 1 

atg aag etc acc acc teg 
Met Lys Leu Thr Thr Ser 
-15 

gcc cac tac acc ttc ccg 
Ala His Tyr Thr Phe Pro 
-11 5 

gaa tgg gtg acg ate cgt 
Glu Trp vaT Thr lie Arg 
20 

gtc acc gac gtc acg teg 
val Thr Asp val Thr Ser 
35 

ggc acc cee geg ece eag 
Giy Thr Pro Ala Pro Gin 
SO 

ace ttc acc gtc gac cca 
Thr Phe Thr val Asp Pro 
65 

tac atg gee aag geg ccg 
Tyr Met Ala Lys Ala Pro 
80 85 

ace gge aac gtc tgg ttc 
Thr Gly Asn val Trp Phe 
100 

ggc acc age aae ate acc 
Gly Thr Ser Asn lie Thr 
115 



gtc gee 
val Ala 
-10 

eag acg 
Gin Thr 



gag acg 
Glu Thr 



gac eag 
Asp Gin 



ate gee 
lie Ala 
55 

age ate 
Ser lie 
70 

teg gge 
Ser Gly 



aag ate 
Lys He 



tgg cet 
Trp Pro 



ctg ctt gee geg 
Leu Leu Ala Ala 



geg ggc 
Ala Gly 



gac ate 
Asp lie 



acg aae 
Thr Asn 
25 

ate cgc 
lie Arg 
40 

acg gtg 
Thr VaT 



aae ggc 
Asn Gly 
10 

cac tac 
His Tyr 



tgc tac 
cys Tyr 



eag gee 
Gin Ala 



eag cac ccg ggc 
Gin His Pro Gly 
75 

eag acc gee gcc 
Gin Thr Ala Ala 
90 

tac gag gac ggc 
Tyr Glu Asp Gly 
105 

age age gge aaa 
Ser ser Gly Lys 
120 
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eag etc 
Gin Leu 



teg cac 
ser His 



gag etc 
Glu Leu 
45 

ggc ggc 
Gly Gly 
60 

ccg ctg 
Pro Leu 



ace ttc 
Thr Phe 



ccg tec 
Pro Ser 



ace gaa 
Thr Glu 
125 



geg caa 
Ala Gin 



tec ggc 
ser Gly 
15 

ggg ccg 
Gly Pro 
30 

aac ccg 
Asn Pro 



acc gtg 
Thr val 



eag ttc 
Gin Phe 



eag gge 
Gin Gly 
95 

gge etc 
Gly Leu 
110 

gtc age 
val ser 
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144 



192 



240 



288 



336 



384 
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480 



528 



576 



gtc aag ate ccc teg tgc ate gcg ceg ggc gac tac etc ctg cgc gtg 
val Lys He Pro Ser cys He Ala Pro Giy Asp Tyr teu Leu Arg vaT 
130 135 140 

gag eac ate gcg ctg cae age gee age ace gtc ggc ggc gee cag ttc 
Glu ms He Ala Leu His Ser Ala Ser Thr val cTy GTy Ala Gin Phe 
145 150 155 

tac etc gcg tgc gee cag ctg ace gtc ace ggc ggc ace ggc ace etc 
Tyr Leu Ala Cys Ala Gin Leu Thr val Thr Giy Gly Thr Giy Thr Leu 
160 165 170 175 

aac aeg ggc gag etc gtc gee ttc ccc ggc gee tac age gee ace gac 624 
Asn Thr Gly Glu Leu val Ala Phe Pro Gly Ala Tyr ser Ala Thr Asp 
180 185 190 

ceg ggc ate etc ttc cag ctg tac tgg ccc ate ceg ace age tac ace 
Pro Gly lie Leu Phe Gin Leu Tyr Trp Pro He Pro Thr Ser Tyr Thr 
195 200 205 



672 



aac ccc ggc ceg gcg ccc gtt age tgc 699 
Asn Pro Gly Pro Ala Pro val Ser Cys 
210 215 

<210> 2 
<211> 233 
<212> PRT 

<213> T. terrestris 
<400> 2 

Met Lys Leu Thr Thr Ser Val Ala Leu Leu Ala Ala Ala Gly Ala Gin 
-15 -10 -5 

Ala His Tyr Thr Phe Pro Gin Thr Asp lie Asn Gly Gin Leu Ser Gly 
-11 5 10 15 

Glu Trp Val Thr lie Arg Glu Thr Thr Asn His Tyr Ser His Gly Pro 
20 25 30 

val Thr Asp val Thr Ser Asp Gin He Arg cys Tyr Glu Leu Asn Pro 
35 40 45 

Gly Thr Pro Ala Pro Gin lie Ala Thr val Gin Ala Gly Gly Thr val 
50 55 60 

Thr Phe Thr Val Asp Pro ser lie Gin His Pro Gly Pro Leu Gin phe 
65 70 75 

Tyr Met Ala Lys Ala Pro ser Gly Gin Thr Ala Ala Thr Phe Gin Gly 
80 85 90 95 

Thr Gly Asn val Trp Phe Lys He Tyr Glu Asp Gly Pro Ser Gly Leu 
100 105 110 

Gly Thr Ser Asn lie Thr Trp Pro Ser Ser Gly Lys Thr Glu val Ser 
115 120 125 
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val Lys lie Pro ser cys lie Ala Pro Gly Asp Tyr Leu Leu Arg val 
130 135 140 



Glu His lie Ala Leu His Ser Ala Ser Thr val Gly Gly Ala Gin Phe 
145 150 155 



Tyr Leu Ala Cys Ala Gin Leu Thr val Thr Gly Gly Thr Gly Thr Leu 
160 165 170 175 



Asn Thr Gly Glu Leu Val Ala Phe Pro Gly Ala Tyr Ser Ala Thr Asp 
180 185 190 



Pro Gly He Leu Phe Gin Leu Tyr Trp Pro lie Pro Thr ser Tyr Thr 
195 200 205 



Asn Pro Gly Pro Ala Pro val ser cys 
210 215 



<210> 3 

<211> 957 

<212> DNA 

<213> Humicola insol ens 
<220> 

<221> CDS 

<222> (1)..(957) 

<223> 

<220> 



<220> 

<221> mat_peptide 

<222> (46).. O 
<223> 

<220> 

<221> misc^feature 

<222> (858)., (957) 

<223> CBMl domain 



<400> 3 

atg agg ccc ttc tec etc gtc gcg ctg gcg acg gee gtc age ggc eac 
Met Arg pro Phe Ser Leu val Ala Leu Ala Thr Ala val Ser Gly His 
-15 -10 -5 -1 1 

gee ate ttc eag egc gtg teg gtt aac got gtc gac eaa ggc cag etc 
Ala lie Phe Gin Arg val ser val Asn Gly val Asp Gin Giy Gin Leu 
5 10 15 

aag ggc gtg egg get ccc teg age aac tac ccc att gag aac gtc aac 
Lys Gly val Arg Ala Pro Ser ser Asn Tyr Pro He Glu Asn val Asn 
20 25 30 

cac ccc gac ttt gee tgc aac ace aac ate egg cac cgc gac ggc ace 
His Pro Asp Phe Ala Cys Asn Thr Asn lie Arg His Arg Asp Gly Thr 



<221> 
<222> 
<223> 



sig_peptide 
(1)..(45) 



35 



40 



45 
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gtc ate aag ate ccc gcc ggc gcc ace gtc ggc gcc tgg tgg cag cac 

val lie Lys He Pro Ala Giy Ala Thr val Giy Ala Trp Trp Gin His 

50 55 60 65 

gag ate ggc ggg ccc teg ttc ccc ggt gat ccg gat aac ccg ate get 

Glu lie Giy Giy Pro ser Phe Pro Giy Asp Pro Asp Asn Pro lie Ala 
70 75 80 



ggc ggc gcc cag ttc tac atg gag tgc gcc cag ate gag gtc ace ggc 
Giy Giy Ala Gin Phe Tyr Met Glu cys Ala Gin He Glu val Thr Giy 
165 170 175 



<210> 4 
<211> 319 
<212> PRT 
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240 
288 



gee teg cac aag ggc ccc ate eaa gtc tat etc gcc aag gtc gac aac 336 
Ala Ser His Lys Giy Pro lie Gin val Tvr Leu Ala Lys val Asp Asn 
85 90 95 



gcc get acc gcc tet ccc aac ggc ctg egg tgg ttc aag att gee gag 384 
Ala Ala Thr Ala Ser Pro Asn Giy Leu Arg Trp Phe Lys He Ala Glu 
100 105 110 



aag ggc ctg teg ggc ggc gtc tgg gcc gtc gac gag atg ate cgc aac 432 
Lys Giy Leu ser Giy Giy val Trp Ala Val Asp Glu Met He Arg Asn 
115 120 125 

aac ggc tgg cac tac ttc acc atg ccg cag tgc ate gcg ccc ggc cac 480 
Asn Giy Trp His Tyr Phe Thr Met Pro Gin Cys He Ala Pro Giy His 
130 135 140 145 

tac ctg atg cgc gtc gag ctg ctg gcg ctg cac teg gcc age ttc ccc 528 
Tyr Leu Met Arg val Glu Leu Leu Ala Leu His Ser Ala Ser Phe Pro 
150 155 160 
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teg ggc aac ttc teg ccc tec gag aeg gtc age ttc ccc ggc gee tac 624 
ser Giy Asn Phe Ser pro ser Glu Thr val Ser Phe Pro Giy Ala Tyr 
180 185 190 



ccg gcc aac cac ccg ggt ate gtc gtc age ate tac gac gee cag ggt 672 
pro Ala Asn His Pro Giy He val val Ser He Tyr Asp Ala Gin Giy 
195 200 205 



aac gcc aac aac ggc ggg cgc gag tac cag ate ccc ggg ccg egg ccg 720 

Asn Ala Asn Asn Giy Giy Arg Glu Tyr Gin He Pro Giy Pro Arg Pro 

210 215 220 225 

ate acc tgc tee ggc ggt gga age aac aat ggt ggc ggg aac aac aat 768 

He Thr Cys ser Giy Giy Giy Ser Asn Asn Giy Giy Giy Asn Asn Asn 
230 235 240 

ggt ggt gga aac aat aac ggc ggc ggc ggc aac aac aac ggc ggt ggg 

Giy Giy Giy Asn Asn Asn Giy Giy Giy Giy Asn Asn Asn Giy Giy Giy 
245 250 255 



816 



aac aac aac ggt ggc ggt aac acc ggt ggc ggc teg gcg ccg etc tgg 864 
Asn Asn Asn Giy Giy Giy Asn Thr Giy Giy Giy Ser Ala Pro Leu Trp 
260 265 270 

ggc cag tgc ggc ggc aat ggg tat tec ggc ccg aeg act tgc gcc gag 912 
Giy Gin cys Giy Giy Asn Giy Tyr Ser Giy Pro Thr Thr Cys Ala Glu 
275 280 285 



ggt act tgc aag aag cag aat gac tgg tac teg cag tgt aeg ect 957 
Giy Thr cys Lys Lys Gin Asn Asp Trp Tyr ser Gin cys Thr Pro 
290 295 300 
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<213> HumicoTa insolens 
<400> 4 

Met Arg Pro Phe ser Leu val Ala Leu Ala Thr Ala val ser Gly His 
-15 -10 -5 -1 1 

Ala He Phe Gin Arg val Ser val Asn Gly Val Asp Gin Gly Gin Leu 
5 10 15 

Lys Gly val Arg Ala Pro Ser Ser Asn Tyr Pro He Glu Asn val Asn 
20 25 30 

His Pro Asp Phe Ala Cys Asn Thr Asn He Arg His Arg Asp Gly Thr 
35 40 45 

val He Lys He Pro Ala Gly Ala Thr Val Gly Ala Trp Trp Gin His 
50 5S 60 65 

Glu He Gly Gly Pro ser Phe Pro Gly Asp Pro Asp Asn Pro He Ala 
70 75 80 

Ala ser His Lys Gly Pro lie Gin val Tyr Leu Ala Lys val Asp Asn 
85 90 95 

Ala Ala Thr Ala Ser Pro Asn Gly Leu Arg Trp Phe Lys lie Ala Glu 
100 105 110 

Lys Gly Leu ser Gly Gly val Trp Ala val Asp Glu Met lie Arg Asn 
115 120 125 

Asn Gly Trp His Tyr Phe Thr Met Pro Gin cys He Ala Pro Gly His 
130 135 140 145 

Tyr Leu Met Arg val Glu Leu Leu Ala Leu His Ser Ala Ser Phe Pro 
150 155 160 

Gly Gly Ala Gin Phe Tyr Met Glu Cys Ala Gin He Glu val Thr Gly 
165 170 175 

ser Gly Asn Phe Ser pro Ser Glu Thr val Ser Phe Pro Gly Ala Tyr 
180 185 190 

Pro Ala Asn His Pro Gly He val val ser He Tyr Asp Ala Gin Gly 
195 200 205 

Asn Ala Asn Asn Gly Gly Arg Glu Tyr Gin lie Pro Gly pro Arg Pro 
210 215 220 ^ 225 

lie Thr cys Ser Gly Gly Gly Ser Asn Asn Gly Gly Gly Asn Asn Asn 
230 235 240 
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Gly G1y Gly Asn Asn Asn Gly Gly Gly Gly Asn Asn Asn Gly Gly Gly 
245 250 255 



Asn Asn Asn Gly Gly Gly Asn Thr Gly Gly Gly Ser Ala Pro Leu Trp 
260 265 270 

Gly Gin Cys Gly Gly Asn Gly Tyr Ser Gly Pro Thr Thr Cys Ala Glu 
275 280 285 

Gly Thr Cys Lys Lys Gin Asn Asp Trp Tyr Ser Gin Cys Thr Pro 
290 295 300 

<210> 5 

<211> 660 

<212> DNA 

<213> coprinus cinereus 

<220> 

<221> CDS 

<222> (1) . • (660) 

<223> 

<220> 

<221> S'ig.peptide 

<222> Cl5..C57) 
<223> 

<220> 

<221> mat^peptide 

<222> C58)-.0 
<223> 

<400> 5 

atg aag gtc ttc gca tac gtc gcc etc etc gcc gcc get gcc cag teg 48 

Met Lys val Phe Ala Tyr val Ala Leu Leu Ala Ala Ala Ala Gin ser 
-15 -10 -5 

get tct get cac tac ate tgg ace acc etc acc gcc ggc ggc cag acc 96 
Ala ser Ala His Tyr lie Trp Thr Thr Leu Thr Ala Gly Gly Gin Thr 
-11 5 10 

acc tec gee gtc ate cgc cag ccc etc aac aac tec ccc gtc gag gae 144 
Thr ser Ala val lie Arg Gin Pro Leu Asn Asn ser Pro val Glu Asp 
15 20 25 

gtc tec tec ccc cac atg cgc tgc aac gtc aac ccc atg ccc gcc teg 192 
val ser Ser Pro His Met Arg cys Asn val Asn Pro Met Pro Ala Ser 
30 35 40 45 

cag acc etc aac gtt cag gcc ggc tec age gtc acc ttc cgc etc gae 240 
Gin Thr Leu Asn val Gin Ala Gly ser ser val Thr Phe Arg Leu Asp 
50 55 60 



aac ace etc tac cac ccc ggc ccc gee gcc ate tac etc ggc caa gtc 288 
Asn Thr Leu Tyr His Pro Gly Pro Ala Ala lie Tyr Leu Gly Gin val 
65 70 75 



ccc gcc ggc cag ace gcc gee tct tgg gae gga age got get aac tgg 336 
Pro Ala Gly Gin Thr Ala Ala Ser Trp Asp Gly Ser Gly Ala Asn Trp 
80 85 90 

ttc aag ate gae gag ttc ggc gee cag ttc aac ccc ttc cgc ttc ate 384 
Phe Lys lie Asp Glu Phe Gly Ala Gin Phe Asn Pro Phe Arg Phe lie 
95 100 105 
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ccc gac gac cag age cag etc tec acc acc ate ece tee aac act ecc 432 

Pro Asp Giy Gin Ser Gin Leu Ser Thr Thr lie Pro Ser Asn Thr Pro 
110 115 120 125 

age ggc gag tac etc etc cgc ate gag cac ate got etc cae gte gee 480 

Ser Gly Glu Tyr Leu Leu Arg lie Glu His lie Giy Leu His val Ala 
130 135 140 



ggc get ece caa tac tac ate tee tgc gee caa ate ege gtt aat ggc 528 
Gly Ala Pro Gin TVr TVr lie Ser Cys Ala Gin lie Arg val Asn Gly 
145 150 155 



ggt gga ggc goc aac cca ccc aag gte tec ate ccc gga tac gte teg 
Gly Gly Gly Gly Asn Pro Pro Lys val ser lie Pro Gly Tyr val Ser 
160 165 170 

agg aac gac ccc ggt etc ace gte aac ate cae tgg ccc ate ccc acc 
Arg Asn Asp Pro Gly Leu Thr val Asn He His Trp Pro He Pro Thr 
175 180 185 



<210> 6 
<211> 220 
<212> PRT 

<213> coprinus cinereus 
<400> 6 

Met Lys Val Phe Ala Tyr val Ala Leu Leu Ala Ala Ala Ala Gin Ser 
-15 -10 -5 

Ala ser Ala His Tyr lie Trp Thr Thr Leu Thr Ala Gly Gly Gin Thr 
-11 5 10 

Thr ser Ala val lie Arg Gin Pro Leu Asn Asn Ser Pro val Glu Asp 
15 20 25 

val Ser ser Pro His Met Arg cys Asn val Asn Pro Met pro Ala ser 
30 35 40 45 

Gin Thr Leu Asn val Gin Ala Gly Ser ser val Thr Phe Arg Leu Asp 
50 55 60 

Asn Thr Leu Tyr His Pro Gly Pro Ala Ala lie Tyr Leu Gly Gin Val 
65 70 75 

Pro Ala Gly Gin Thr Ala Ala Ser Trp Asp Gly Ser Gly Ala Asn Trp 
80 85 90 

Phe Lys He Asp Glu Phe Gly Ala Gin Phe Asn Pro Phe Arg Phe He 
95 100 105 

Pro Asp Gly Gin Ser Gin Leu Ser Thr Thr lie Pro Ser Asn Thr Pro 
110 115 120 125 
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576 
624 



tee tac ace gte ccc gga ece ege ccc tgg agg gga 660 
Ser Tyr Thr val Pro Gly Pro Arg Pro Trp Arg Gly 
190 195 200 
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Ser Gly Glu Tyr Leu Leu Arg lie Glu His lie Gly Leu His Val Ala 
130 135 140 

Gly Ala Pro Gin Tyr Tyr He Ser Cys Ala Gin lie Arg val Asn Gly 
145 150 155 

Gly Gly Gly Gly Asn Pro Pro Lys val Ser lie Pro Gly Tyr val ser 
160 165 170 

Arg Asn Asp Pro Gly Leu Thr val Asn lie His Trp Pro lie Pro Thr 
175 180 185 

Ser Tyr Thr val Pro Gly Pro Arg Pro Trp Arg Gly 
190 195 200 

<210> 7 
<211> 30 
<212> DNA 

<213> Primer A for SEQ ID NO: 3 
<400> 7 

gcggaattca tcatgaggcc cttctccctc 30 

<210> 8 
<211> 32 
<212> DNA 

<213> Primer B for seq id no: 3 

<400> 8 

atttgcggcc gcttcccgtc atcctctaag gc 32 

<210> 9 
<211> 29 
<212> DNA 

<213> primer A for SEQ 10 NO: 1 

<400> 9 

gcggaattca tgaagctcac cacctcggt 29 

<210> 10 
<211> 32 
<212> DNA 

<213> Primer B for SEQ ID NO: 1 
<400> 10 

atttgcggcc gcgcagccaa ccaacctgga at 32 

<210> 11 
<211> 30 
<212> DNA 

<213> Primer A for SEQ ID NO: 5 

<400> 11 

gcggaattca caatgaaggt cttcgcatac 30 



<210> 12 
<211> 32 
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<212> DNA 

<213> Primer B for SEQ 10 NO: 5 
<400> 12 

atttgcggcc gcacgatgcg atgagcattt at 
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